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Table 2-1: 






Line 


Function (Now) 


Function (Should Be) 


1 


XII 


XA11 


2 


X12 


XA12 


5 


X0D2 


XD2 


20 


I0A5 


XA5 


26 


IGD23 


XD23 


27 


I0D24 


XD24 


28 


IOD25 


XD25 


28 


VSUBS 


SUBS 


29 


IOD26 


XD26 


30 


IOD27 


XD27 


31 


iOD28 


XD28 


32 


IOD29 


XD29 


33 


I0D30 


XD30 


34 


I0D31 


XD31 


35 


lORDY 


XRDY 



2-6 


Table 2-2. Insert the following at the end of the table. 


LOCATOR {1 PIN) 1 


NONE 


1 1 1 NC 1 Reserved. See Table 2-1 and Figure 2-1. 



7-9 

A-5 

A-6 

A-6 
A-7 

A-8 

A-9 

A-9 



Line 7: src should be dst. 

Table A-5: Characteristics (13), (14), (15), (16), (17), and (18) change (10) to (X) 
in name and description. 

Figure A-4: Change (IO)R/W to (X)R/W, (IO)A to (X)A, (IO)D to (X)D, and 
(lO)RDY to (X)RDY. 

Table A-6: All characteristics change (10) to (X) in name and description. 

Figure A-5: Change (IO)R/W to (X)R/W, (IO)A to (X)A, (IO)D to (X)D, and 
(lO)RDY to (X)RDY. 



Figure A-6: Ch ange lOR /W to X R/W, lOA to XA, lOD to XD, and lORDY to 
XRDY. Change (M)STRB in title to lOSTRB. 

Table A-7: Characteristics (22), (14.1), (15.1), (16.1), (17.1), and (18.1) change 10 
to X in name and description. 

Table A-8: All characteristics change 10 to X in name and description. 

The changes shown in this Manual Update will be included in the next revision of the Third-Generation 
TMS320 User's Guide. 
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Section 1 

Introduction 



The TMS320C30 (third-generation) Digital Signal Processor (DSP) is a 
high-performance CMOS 32-bit device in the TI\/IS320 family of single-chip 
digital signal processors. Since 1982 when the TMS32010 was introduced, 
the TMS320 family has established itself as the industry standard for digital 
signal processing. Powerful instruction sets, high-speed number-crunching 
capabilities, and innovative architectures have made this high-performance 
family of processors ideal for DSP applications. 

The TMS320 family consists of three generations of processors: TMS320C1 x, 
TMS320C2X, and TMS320C3x (see Figure 1 -1 ). The family has expanded to 
include enhancements of earlier generations and more powerful new gener- 
ations of digital signal processors. 



TMS320C1X 

32010 • 16/32-bHCPU 

32011 • 160-ni Initr cycl* 
320C10 • 2S6 W data RAM 
320C15 • AKWROM/EPflOM 
320E15 • 4X W axt prog main 
320C17 • 16 X 16'32-bHniult 
320E17 • 2 tarial ports 

• Companding H/W 

• Coprocataor l/F 



TM320C2X 



32020 • 
320C25 • 



16/32-btt CPU 
100-ni Inttr cycia 
644 W data RAM 
4K W prog ROM 
128KW total mam 
16 X 16 = 32blt mult 
Sarial port and tlmar 
Block mova/rapaat 
MultlprocaHOr l/F 



TMS320C3X 

320C30 • 32-blt float-pt CPU 

• 60-na Initr cycIa 

• 2K W RAM 

• 4K W ROM 

• 64 W Inatr cacha 

• 16M W total mam 

• 32 X 32 = 40-blt muH 

• 2 sarial ports 

• 2 tlmara 

• DMA 



Figure 1-1. TMS320 Device Evolution 
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Introduction 




This document discusses the third-generation device, TMS320C30, within the 
TMS320 family. The 60-ns cycle time of the TMS320C30 allows it to execute 
operations at a performance rate previously available only on a supercomputer. 
Even higher performance is gained through its large on-chip memories, con- 
current DMA controller, and instruction cache. 

his section presents the following major topics: 

General Description (Section 1 .1 on page 1 -3) 

Key Features (Section 1 .2 on page 1 -4) 

Typical Applications (Section 1 .3 on page 1 -5) 

How To Use This Manual (Section 1 .4 on page 1 -6) 

References (Section 1 .5 on page 1 -8) 
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1.1 General Description 



The TMS320's internal busing and special digital signal processing (DSP) 
instruction set provide speed and flexibility.This combination produces a pro- 
cessor family capable of executing up to 33 MFLOPS (million floating-point 
operations per second). The TMS320 family optimizes speed by implementing 
functions in hardware that other processors implement through software or 
microcode. This hardware-intensive approach provides the design engineer 
with power previously unavailable on a single chip. 

The TMS320C30, the third-generation device in the TMS320 family, can 
perform parallel multiply and ALU operations on integer or floating-point data 
in a single cycle. The processor also possesses a general-purpose register file, 
program cache, dedicated auxiliary register arithmetic units (ARAL)), internal 
dual-access memories, one DMA channel supporting concurrent I/O, and a 
short machine-cycle time. High performance and ease of use are achieved 
through greater parallelism, greater accuracy, and general-purpose features. 

General-purpose applications are greatly enhanced by the large address space, 
multiprocessor interface, internally and externally generated wait states, two 
timers, two serial ports, and multiple interrupt structure. The TMS320C30 
supports a wide variety of system applications from host processor to dedi- 
cated coprocessor. 

The emphasis on total system cost has resulted in a less-expensive processor 
that can be designed into systems currently using costly bit-slice processors. 
High-level language is more easily supported through a register-based archi- 
tecture, large address space, powerful addressing modes, flexible instruction 
set, and support of floating-point arithmetic. 
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1.2 Key Features 

Some key features of the TMS320C30 are listed below. 

• 60-ns single-cycle instruction execution time 

- 33.3 MFLOPS (million floating-point operations per second) 

- 16.7 MIPS (million instructions per second) 
One 4K x 32-bit single-cycle dual-access on-chip ROM block 
Two 1 K X 32-bit single-cycle dual-access on-chip RAM blocks 
64 X 32-bit instruction cache 
32-bit instruction and data words, 24-bit addresses 
40/32-bit floating-point/integer multiplier and ALU 
32- bit barrel shifter 

Eight extended-precision registers (accumulators) 
Two address generators with eight auxiliary registers and two auxiliary 
register arithmetic units 

On-chip Direct Memory Access (DMA) controller for concurrent I/O and 
CPU operation 

Integer, floating-point, and logical operations 

Two- and three-operand instructions 

Parallel ALU and multiplier instructions in a single cycle 

Block repeat capability 

Zero-overhead loops with single-cycle branches 

Conditional calls and returns 

Interlocked instructions for multiprocessing support 

Two serial ports to support 8/16/32-bit transfers 

Two 32- bit timers 

Two general-purpose external flags, four external interrupts 

180-pin grid array (PGA) package; ^ [im CMOS 
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1.3 Typical Applications 



The TMS320 family's unique versatility and realtime performance offer flexible 
design approaches in a variety of applications. In addition, TMS320 devices 
can simultaneously provide the multiple functions often required in those 
complex applications. Table 1 -1 lists typical TMS320 family applications. 

Table 1 -1 . Typical Applications of the TMS320 Family 




GENERAL-PURPOSE DSP 


GRAPHICS/IMAGING 


INSTRUMENTATION 


Digital Filtering 
Convolution 
Correlation 
Hilbert Transforms 
Fast Fourier Transforms 
Adaptive Filtering 
Windowing 
Waveform Generation 


3-D Rotation 
Robot Vision 
Image Transmission/ 
Compression 
Pattern Recognition 
Image Enhancement 
Homomorphic Processing 
Workstations 
Animation/Digital Map 


Spectrum Analysis 
Function Generation 
Pattern Matching 
Seismic Processing 
Transient Analysis 
Digital Filtering 
Phase- Locked Loops 


VOICE/SPEECH 


CONTROL 


MILITARY 


Voice Mail 
Speech Vocoding 
Speech Recognition 
Speaker Verification 
Speech Enhancement 
Speech Synthesis 
Text-to-Speech 
Neural Networks 


Disk Control 
Servo Control 
Robot Control 
Laser Printer Control 
Engine Control 
Motor Control 
Kalman Filtering 


Secure Communications 
Radar Processing 
Sonar Processing 
Image Processing 
Navigation 
Missile Guidance 
Radio Frequency Modems 
Sensor Fusion 


TELECOMMUNICATIONS 


AUTOMOTIVE 


Echo Cancellation FAX 
ADPCM Transcoders Cellular Telephones 
Digital PBXs Speaker Phones 
Line Repeaters Digital Speech 
Channel Multiplexing Interpolation (DSI) 
1 200 to 1 9200- bps Modems X.25 Packet Switching 
Adaptive Equalizers Video Conferencing 
DTMF Encoding/Decoding Spread Spectrum 
Data Encryption Communications 


Engine Control 
Vibration Analysis 
Antiskid Brakes 
Adaptive Ride Control 
Global Positioning 
Navigation 
Voice Commands 
Digital Radio 
Cellular Telephones 


CONSUMER 


INDUSTRIAL 


MEDICAL 


Radar Detectors 
Power Tools 
Digital Audio/TV 
Music Synthesizer 
Toys and Games 
Solid-State Answering 
Machines 


Robotics 
Numeric Control 
Security Access 
Power Line Monitors 
Visual Inspection 
Lathe Control 
CAM 


Hearing Aids 
Patient Monitoring 
Ultrasound Equipment 
Diagnostic Tools 
Prosthetics 
Fetal Monitors 
MR Imaging 
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1.4 How To Use This Manual 



The purpose of this user's guide is to serve as a reference book for the 
TMS320C30 digital signal processor. This document is designed to provide 
information that assists managers and hardware/software engineers in appli- 
cation development. The first group of sections provides specific information 
about the architecture and hardware operation of the device. Later sections 
describe the software operation. Specific software and hardware applications 
are provided in Sections 1 2 and 1 3, respectively. Electrical specifications and 
mechanical data can be found in the data sheet (Appendix A). 

The following table lists each section and briefly describes the section con- 
tents. 

Section 2. Pinout and Signal Descriptions. Drawing of the PGA 

package for the TMS320C30. Functional listing of the 
signals, their pin locations, and descriptions. 

Section 3. Architectural Overview. Functional block diagram. 

TMS320C30 design description, hardware components, 
and device operation. Instruction set summary. 

Section 4. CPU Registers, Memory, and Cache. Description of the 

registers in the CPU register file. Memory maps provided 
and instruction cache architecture, algorithm, and control 
bits explained. 

Section 5. Data Formats and Floating-Point Operations. Description 

of signed and unsigned integer and floating-point formats. 
Discussion of floating-point multiplication, addition, sub- 
traction, normalization, rounding, and conversions. 

Section 6. Addressing. Operation, encoding, and implementation of 

addressing modes. Format descriptions. System stack 
management. 

Section 7. Program Flow Control. Software control of program flow 

with repeat modes and branching. Interlocked operations. 
Reset and interrupts. 

Section 8. External Bus Operation. Description of primary and expan- 

sion interfaces. External interface timing diagrams. Pro- 
grammable wait-states and bank switching. 

Section 9. Peripherals. Description of the DMA controller, timers, and 

serial ports. 

Section 10. Pipeline Operation. Discussion of the pipelining of oper- 

ations on the TMS320C30. 

Section 11. Assembly Language Instructions. Functional listing of in- 

structions. Condition codes defined. Alphabetized indi- 
vidual instruction descriptions with examples. 

Section 12. Software Applications. Software application examples for 

the use of various TMS320C30 instruction set features. 
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Section 13. Hardware Applications. Hardware design techniques and 

application examples for interfacing to memories, periph- 
erals, or other microcomputers/microprocessors. 

Four appendices are included to provide additional information. 

Appendix A. TMS320C30 Data Sheet. Electrical specifications, timing, 
and mechanical data. 

Appendix B. Development Support/Part Order Information. Listings of 

the hardware and software available to support the 
TMS320C30 device. 

Appendix C. Instruction Opcodes. List of the opcode fields for all the 
TMS320C30 instructions. 

Appendix D. Quality and Reliability. Discussion of Texas Instruments 
quality and reliability criteria for evaluating performance. 
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1 .5 References 
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operations, and applications of digital signal processing. These books also 
provide other references to many useful technical papers. The reference list is 
organized into categories of general DSP, speech, image processing, and di- 
gital control theory, and alphabetized by author. 
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McGraw-Hill Company, Inc., 1979. 
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York, NY: McGraw-Hill Company, Inc., 1969. 

Hamming, R.W., Digital Filters. Englewood Cliffs, NJ: Prentice- Hall, 
Inc., 1977. 

IEEE ASSP DSP Committee (Editor), Programs for Digital Signal Pro- 
cessing. New York, NY: IEEE Press, 1979. 

Jackson, Leiand B., Digital Filters and Signal Processing. Hingham, MA: 
Kluwer Academic Publishers, 1986. 

Jones, D.L. and Parks, T.W., A Digital Signal Processing Laboratory 
Using the TMS32010. Englewood Cliffs, NJ: Prentice- Hall, Inc., 1987. 

Lim, Jae and Oppenheim, Alan V. (Editors), Advanced Topics in Signal 
Processing. Englewood Cliffs, NJ: Prentice- Hall, Inc., 1988. 

Morris, L. Robert, Digital Signal Processing Software. Ottawa, Canada: 
Carleton University, 1983. 

Oppenheim, Alan V. (Editor), Applications of Digital Signal Processing. 
Englewood Cliffs, NJ: Prentice- Hall, Inc., 1978. 

Oppenheim, Alan V. and Schafer, R.W., Digital Signal Processing. En- 
glewood Cliffs, NJ: Prentice- Hall, Inc., 1976. 

Oppenheim, Alan V. and Willsky, A.N. with Young, I.T., Signals and 
Systems. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1983. 

Parks, T.W. and Burrus, C.S., Digital Filter Design. New York, NY: John 
Wiley and Sons, Inc., 1987. 

Rabiner, Lawrence R., Gold and Bernard Theory and Application of 
Digital Signal Processing. Englewood Cliffs, NJ: Prentice- Hall, Inc., 
1975. 
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Treichler, J.R., Johnson, Jr., C.R., and Larimore, M.G., Theory and De- 
sign of Adaptive Filters. New York, NY: John Wiley and Sons, Inc., 
1987. 

Speech: 

Gray, A.H. and Markel, J.D., Linear Prediction of Speecti. New York, 
NY: Springer-Verlag, 1976. 

Jayant, N.S. and Noll, Peter, Digital Coding of Wavefornns. Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1984. 

Papamichalis, Panos, Practical Approaches to Speech Coding. Engle- 
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Rabiner, Lawrence R. and Schafer, R.W., Digital Processing of Speech 
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image Processing: 

Andrews, H.C. and Hunt, B.R., Digital Image Restoration. Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1977. 

Gonzales, Rafael C. and Wintz, Paul, Digital Image Processing. Reading, 
MA: Addison-Wesley Publishing Company, Inc., 1977. 

Pratt, William K., Digital Image Processing. New York, NY: John Wiley 
and Sons, 1978. 

Digital Control Theory: 

Jacquot, R., Modern Digital Control Systems. New York, NY: Marcel 
Dekker, Inc., 1 981 . 

Katz, P., Digital Control Using Microprocessors. Englewood Cliffs, NJ: 
Prentice- Hall, Inc., 1981. 

Kuo, B.C., Digital Control Systems. New York, NY: Holt, Reinholt and 
Winston, Inc., 1980. 

Moroney, P., Issues in the Implementation of Digital Feedback Com- 
pensators. Cambridge, MA: The MIT Press, 1983. 

Phillips, C. and Nagle, H., Digital Control System Analysis and Design. 
Englewood Cliffs, NJ: Prentice- Hall, Inc., 1984. 
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Section 2 

Pinout and Signal Descriptions 



The TMS320C30 (third-generation TIVIS320) digital signal processor is 
available in a 180-pin grid array (PGA) package. The pinout of this package 
(Figure 2-1), and a functional listing of the signals, pin locations, and de- 
scriptions are provided in this section. Electrical specifications and mechanical 
data are given in the data sheet (Appendix A). 





1 23456789 101112131415 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 

P 


• •••• • •••• 


Q 





Figure 2-1. TMS320C30 Pin Assignments 



2-1 



Pinout and Signal Descriptions 



Table 2-1. TMS320C30 Pin Function Assignments 




NOTE; 
1) 

2) 
3) 



Function 


Pin 


Function 


Pin 


Function 


Pin 


Function 


Pin 


Function 


Pin 


AO 


F15 


EMUO 


F14 


D19 


A9 


lACK 


G1 


XII 


D14 


A1 


G12 


EMU1 


El 5 


D20 


89 


INTO 


H2 


XI 2 


El 3 


A2 


G13 


CLKRO 


N4 


D21 


C9 


INT1 


HI 


XDO 


Q4 


A3 


G14 


CLKRl 


L4 


D22 


A10 


INT2 


J1 


XD1 


P5 


A4 


G15 


CLKXO 


M5 


D23 


D9 


INT3 


J2 


X0D2 


N6 


A5 


H15 


CLKX1 


N2 


D24 


810 


RSVO 


J3 


XD3 


05 


A6 


H14 


DO 


C4 


D25 


All 


RSV1 


J4 


XD4 


P6 


A7 


J15 


D1 


D5 


D26 


CIO 


RSV2 


K1 


XD5 


M7 


A8 


J14 


D2 


A2 


D27 


811 


RSV3 


K2 


XD6 


06 


A9 


J13 


D3 


A3 


D2B 


A12 


RSV4 


LI 


XD7 


N7 


A10 


K15 


D4 


84 


D29 


D10 


RSV5 


K3 


XDB 


P7 


All 


J12 


D5 


C5 


D30 


C11 


RSV7 


K4 


XD9 


07 


A12 


K14 


D6 


D6 


D31 


812 


RSV9 


L3 


XD10 


P8 


A13 


LI 5 


D7 


A4 


DRO 


01 


RSV10 


M2 


XD11 


OB 


A14 


K13 


DB 


85 


DR1 


N1 


XAO 


A13 


XD12 


09 


A15 


LI 4 


D9 


C6 


DXO 


03 


XA1 


A14 


XD13 


P9 


A16 


Ml 5 


D10 


A5 


DX1 


P2 


XA2 


D11 


XD14 


N9 


A17 


K12 


D11 


86 


FSRO 


P3 


XA3 


CI 2 


XD15 


O10 


A18 


LI 3 


D12 


D7 


FSR1 


M3 


XA4 


813 


XD16 


M9 


A19 


M14 


D13 


A6 


FSXO 


02 


I0A5 


A15 


XD17 


P10 


A20 


N15 


D14 


C7 


FSX1 


PI 


XA6 


815 


XD1B 


Oil 


A21 


M13 


D15 


87 


HI 


83 


XA7 


CI 4 


XD19 


N10 


A22 


L12 


D16 


A7 


H3 


A1 


XA8 


El 2 


XD20 


P11 


A23 


N14 


D17 


A8 


HOLD 


F3 


XA9 


D13 


XD21 


012 


EMU5 


CI 


DIB 


88 


HOLDA 


E2 


XA10 


CI 5 


XD22 


M10 


IOD23 


Nil 


LOCATOR 


E5 


TCLK1 


N5 


ADVDD 


D12 


VSS 


NB 


IOD24 


PI 2 


EMU4 


F12 


V8BP 


D3 


ADVDD 


H11 






IOD25 


Q13 


MC/MP 


D15 


VSU8S 


E4 


DDVDD 


D4 


CVSS 


82 


IOD26 


Q14 


MSTRB 


E3 


XI 


C2 


DDVDD 


EB 


CVSS 


P14 


IOD27 


M11 


EMU6 


M6 


X2 


81 


lODVDD 


LB 


DVSS 


C3 


IOD28 


N12 


RDY 


E1 


XFO 


G2 


lODVDD 


M12 


DVSS 


CI 3 


IOD29 


PI 3 


RESET 


F1 


XF1 


G3 


MDVDD 


H5 


DVSS 


N3 


IOD30 


Q15 


R/W 


G4 






PDVDD 


M4 


DVSS 


N13 


I0D31 


PI 5 


EMU2 


F13 


VDD 


DB 






IVSS 


814 


lORDY 


D2 


EMU3 


El 4 


VDD 


H4 


VSS 


C8 






lOR/W 


D1 


STRB 


F2 


VDD 


H12 


VSS 


H3 


RSV6 


L2 


lOSTRB 


F4 


TCLKO 


P4 


VDD 


MB 


VSS 


H13 


RSV8 


Ml 



ADVDD, DDVDD, lODVDD, MDVDD, and PDVDD pins (D4, D12, EB, H5, H11, LB, M4, and M12) 
are on a common plane internal to the device. 

VDD pins (DB, H4, HI 2, and MB) are on a common plane internal to the device. 

VSS, CVSS, and INSS pins (82, 814, C8, H3, HI 3, NB, and PI 4) are on a common plane internal 
to the device. 



4) DVSS pins (C3, CI 3, N3, and N13) are on a common plane internal to the device. 
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2.1 Signal Descriptions 



The signal descriptions for the TMS320C30 device in the microprocessor 
mode are provided in this section. Table 2-2 lists each signal, the number of 
pins, function, and operating mode(s), i.e., input, output, or high-impedance 
state as indicated by I, 0, or Z. All pins labell ed 'NC are not to be connected 
by the user. A line over a signal name (e.g., RESET) indicates that the signal 
is active low true at a logic '0' level. The signals in Table 2-2 are grouped 
according to function. 

Table 2-2. TMS320C30 Signal Descriptions 




SIGNAL 


#PINS 


i/o/zt 


DESCRIPTION 


PRIMARY BUS INTERFACE (61 PINS) 


D(31-0) 


32 


i/O/Z 


32-bit data port of the primary bus interface. 


A (23-0) 


24 


0/Z 


24-bit address port of the primary bus interface. 


R/W 


1 


0/Z 


Read/write signal for primary bus interface. This pin is high 
when a read is performed and low when a write is performed 
over the parallel interface. 


STRB 


1 


0/Z 


External access strobe for the primary bus interface. 


RDY 


1 


1 


Ready signal. This pin indicates that the external device is 
prepared for a primary bus interface transaction to complete. 
As long as RDV is a logic high, the data and address buses 
of the primary bus interface remain valid. 


HOLD 


1 


1 


Hold signal for primary bus interface. When HOLD is a logic 
low, any ongoing transaction is completed. The A(23-0), 
D(31-0), STRB,, and R/W signals are placed in a high-im- 
pedance state, and all transactions over the primary bus in- 


terface are held until HOLD becomes a logic high. 


HOLDA 


1 





Hold acknowledge signal for primary bus interface. This 
signal is generated in response to a logic low on HOLD. It 
signals that A(23-0), D(31-0), STRB, and R/W are placed 
in a high-impedance state and ail transactions over the bus 
will be held. HOLDA will be high in response to a logic 
high of HOLD. 


EXPANSION BUS INTERFACE (49 PINS) 


XD (31-0) 


32 


I/O/Z 


32-bit data port of the expansion bus interface. 


XA(12-0) 


13 


0/Z 


13-bit address port of the expansion bus interface. 


XR/W 


1 


0/Z 


Read/write signal for expansion bus interface. When a read 
is performed, this pin is held high; when a write is per- 
formed, this pin is low. 


MSTRB 


1 


0/Z 


Externa! memory access strobe for the expansion bus inter- 
face. 


lOSTRB 


1 


0/Z 


External I/O access strobe for the expansion bus interface. 


XRDY 


1 


1 


Ready signal. This pin indicates that the external device is 
prepared for an expansion bus interface transaction to 
complete. As long as XRDY is high, the data and address 
buses of the expansion bus interface remain valid. 



f input. Output, High-impedance state. 
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Table 2-2. TMS320C30 Signal Descriptions (Continued) 



SIGNAL 


#PINS 


i/o/zt 


DESCRIPTION 




CONTROL SIGNALS (9 PINS) 


RESET 


1 


1 


Reset. When this pin is a logic low, the device is placed in 
the reset condition. When reset becomes a logic high, exe- 
cution begins from the location specified by the reset vector. 


INT{3-0) 


4 


1 


External interrupts. 


lACK 


1 





Interrupt acknowledge signal. lACK goes low during exe- 
cution of an lACK instruction. This can be used to indicate 
the beginning or end of an interrupt service routine. 


MC/MP 


1 


1 


Microcomputer/microprocessor mode pin. 


XF(I-O) 


2 


1/0 


External flag pins. These pins are formatted as I/O through 
a program instruction, and latched internally when used as 
output pins. They are used as general-purpose I/O pins or 
to support interlocked processor instructions. 


SERIAL PORT SIGNALS (6 PINS) 


CLKXO 




I/O 


Serial port transmit clock. This pin serves as the serial shift 
clock for the serial port transmitter. 


DXO 




0/Z 


Data transmit output. Serial port transmits serial data on 
this pin. 


FSXO 




I/O 


Frame synchronization pulse for transmit. The FSXO pulse 
initiates the transmit data process over pin DXO. 


CLKRO 




I/O 


Serial port receive clock.This pin serves as the serial shift 
clock for the serial port receiver. 


DRO 




1 


Data receive. Serial port receives serial data via the DRO 
pin. 


FSRO 




1 


Frame sychronization pulse for receive. The FSRO pulse ini- 
tiates the receive data process over DRO. 


SERIAL PORT 1 SIGNALS (6 PINS) 


CLKX1 




I/O 


Serial port 1 transmit clock. This pin serves as the serial shift 
clock for the serial port 1 transmitter. 


DX1 




0/Z 


Data transmit output. Serial port 1 transmits serial data on 
this pin. 


FSX1 




I/O 


Frame synchronization pulse for transmit. The FSX1 pulse 
initiates the transmit data process over pin DX1 . 


CLKR1 




I/O 


Serial port 1 receive clock. This pin serves as the serial shift 
clock for the serial port 1 receiver. 


DR1 




1 


Data receive. Serial port 1 receives serial data via the DR1 
pin. 


FSR1 




1 


Frame sychronization pulse for receive. The FSR1 pulse ini- 
tiates the receive data process over DR1 . 



t Input, Output, High-impedance state. 
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Table 2-2. TMS320C30 Signal Descriptions (Continued) 



SIGNAL 


#PINS 


i/o/zt 


DESCRIPTION 


TIMER SIGNALS (1 PIN) 


TCLKO 


1 


I/O 


Timer clock. As an input, TCLKO is used by timer to count 
external pulses. As an output pin, TCLKO outputs pulses 
generated by timer 0. 


TIMER 1 SIGNALS (1 PIN) 


TCLK1 


1 


I/O 


Timer clock. As an input, TCLK1 is used by timer 1 to count 
external pulses. As an output pin, TCLK1 outputs pulses 
generated by timer 1 . 


SUPPLY AND OSCILLATOR SIGNALS (29 PINS) 


Vdd(3-0) 


4 


1 


Four +5 V supply pins. 


IODVdd(I.O) 


2 


1 


Two +5 V supply pins. 


advdd(i.O) 


2 


1 


Two +5 V supply pins. 


PDVdd 


1 


1 


One +5 V supply pin. 


DDVdd(I.O) 


2 


1 


Two +5 V supply pins. 


MDVdd 


1 


1 


One +5 V supply pin. 


Vss(3-0) 


4 


1 


Four ground pins. 


DVss(3-0) 


4 


1 


Four ground pins. 


CVssd.O) 


2 


1 


Two ground pins. 


ivss 


1 


1 


One ground pin. 


Vbbp 


1 


NC 


Vbb pump oscillator output. 


SUBS 


1 


1 


Substrate pin. Tie to ground. 


XI 


1 





Output pin from the internal oscillator for the crystal. If a 
crystal is not used, this pin should be left unconnected. 


X2/CLKIN 


1 


1 


Input pin to the internal oscillator from the crystal or a clock. 


HI 


1 





External H1 clock. This clock has a period equal to twice 
CLKIN. 


H3 


1 





External H3 clock. This clock has a period equal to twice 
CLKIN. 




t Input, Output, High-impedance state. 
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Table 2-2. TMS320C30 Signal Descriptions (Concluded) 




SIGNAL 


#PINS 


i/o/zt 


DESCRIPTION 


RESERVED (18 PINS) 


EMU(0-2) 


3 


1 


Reserved. Use pull-ups to +5 volts. See Section 13.5 


EMU3 


1 





Reserved. See Section 1 3.5 


EMU4 


1 


1 


Reserved. Tie to +5 volts. 


EIVIU(5,6) 


2 


NC 


Reserved. 


RSV(O-IO) 


11 


1 


Reserved. Tie to +5 volts. 



t Input, Output, High-impedance state. 

The user must follow the connections specified for the reserved pins. All pull-up resistors must be 20 k 
ohms. All +5 volt supply pins must be connected to a common supply plane and all ground pins must 
be connected to a common ground plane. 
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Section 3 

Architectural Overview 



Emphasis on hardware and software system solutions to demanding arithmetic 
algorithms has resulted in the TMS320C30 architecture shown in Figure 3-1. 
High system performance is achieved through the accuracy and precision of 
the floating-point units, large on-chip memory, a high degree of parallelism, 
and the DMA controller. 

This section provides an architectural overview of the TMS320C30 processor. 
Major areas of discussion are listed below. 

O Central Processing Unit (CPU) (Section 3.1 on page 3-3) 

- Floating-point/integer multiplier 

- ALU for floating-point, integer, and logical operations 

- Auxiliary register arithmetic units (ARAUs) 
CPU register file 

© Memory Organization (Section 3.2 on page 3-7) 
RAM, ROM, and cache 

- Memory maps 

- Memory addressing modes 

- Instruction set summary 

O Internal Bus Operation (Section 3.3 on page 3-18) 
O External Bus Operation (Section 3.4 on page 3-19) 

O Peripherals (Section 3.5 on page 3-20) 

- Timers 

- Serial ports 

O Direct Memory Access (DMA) (Section 3.6 on page 3-21 ) 



3-1 



Architectural Overview 





Figure 3-1. TMS320C30 Block Diagram 
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3.1 Central Processing Unit (CPU) 

The TMS320C30 has a register-based CPU architecture. The CPU consists 
of the following components: 

© Floating-point/integer multiplier 

O ALU for performing arithmetic (floating-point, integer)and logical oper- 
ations 

© 32-bit barrel shifter 

© Internal buses (CPU1 /CPU2 and REG1 /REG2) 

© Auxiliary register arithmetic units (ARAUs) 

© CPU register file. 

Figure 3-2 shows the various CPU components that are discussed in the 
succeeding subsections. 
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Figure 3-2. Central Processing Unit (CPU) 
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3.1.1 Multiplier 

The multiplier performs single-cycle multiplications on 24-bit integer and 
32-bit floating-point values. The TMS320C30 implementation of floating- 
point arithmetic allows for floating-point operations at fixed-point speeds via 
a 60-ns instruction cycle and a high degree of parallelism. To gain even higher 
throughput, a multiply and ALU operation can be performed in a single cycle 
by using parallel instructions. 

When performing floating-point multiplication, the inputs are 32-bit float- 
ing-point numbers, and the result is a 40-bit floating-point number. When 
performing integer multiplication, the input data is 24 bits and yields a 32-bit 
result. Refer to Section 5 for detailed information on data formats and float- 
ing-point operation. 

3.1.2 Arithmetic Logic Unit (ALU) 

The ALU performs single-cycle operations on 32-bit integer, 32-bit logical, 
and 40-bit floating-point data, including single-cycle integer and floating- 
point conversions. Results of the ALU are always maintained in 32-bit integer 
or 40-bit floating-point formats. The barrel shifter is used to shift up to 32 
bits left or right in a single cycle. 

Internal buses, CPU1/CPU2 and REG1/REG2, carry two operands from me- 
mory and two operands from the register file, thus allowing parallel multiplies 
and adds/subtracts on four integer or floating-point operands in a single cycle. 

3.1.3 Auxiliary Register Arithmetic Units (ARAUs) 

Two auxiliary register arithmetic units (ARAUO and ARAU1 ) can generate two 
addresses in a single cycle. The ARAUs operate in parallel with the multiplier 
and ALU. They support addressing with displacements, index registers (IRO 
and IR1), and circular and bit-reversed addressing. Refer to Section 6 for a 
description of addressing modes. 

3.1.4 CPU Register File 

The TMS320C30 provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. All of these registers can be operated upon by the mul- 
tiplier and ALU, and can be used as general-purpose registers. However, the 
registers also have some special functions for which they are more suited than 
others. For example, the eight extended-precision registers are especially 
suited for maintaining extended-precision floating-point results. The eight 
auxiliary registers support a variety of indirect addressing modes and can be 
used as general-purpose 32-bit integer and logical registers. The remaining 
registers provide system functions such as addressing, stack management, 
processor status, interrupts, and block repeat. Refer to Section 6 for detailed 
information and examples of stack management and register usage. 

The registers names and assigned functions are listed in Table 3-1. Following 
the table, the function of each register or group of registers will be briefly de- 
scribed. Refer to Section 4 for detailed information on each of the CPU reg- 
isters. 
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Table 3-1 . CPU Registers 



REGISTER 
NAME 


ASSIGNED FUNCTION 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 1 
Extended-precision register 2 
Extended-precision register 3 
Extended-precision register 4 
Extended -precision register 5 
Extended-precision register 6 
Extended-precision register 7 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


Auxiliary register 
Auxiliary register 1 
Auxiliary register 2 
Auxiliary register 3 
Auxiliary register 4 
Auxiliary register 5 
Auxiliary register 6 
Auxiliary register 7 


DP 
IRO 
IR1 
BK 
SP 


Data page pointer 
Index register 
Index register 1 
Block size 
System stack pointer 


ST 
IE 
IF 
lOF 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


RS 
RE 
RC 


Repeat start address 
Repeat end address 
Repeat counter 


PC 


Program Counter 



The extended-precision registers (R0-R7) are capable of storing and 
supporting operations on 32-blt integer and 40-bit floating-point numbers. 
Any instruction that assunnes the operands are floating-point numbers uses 
bits 39-0. If the operands are either signed or unsigned integers, only bits 
31 -0 are used, bits 39-32 remain unchanged. This is true for all shift oper- 
ations. Refer to Section 4 for extended-precision register formats for float- 
ing-point and integer numbers. 

The 32-bit auxiliary registers (AR0-AR7) can be accessed by the CPU and 
modified by the two Auxiliary Register Arithmetic Units (ARAUs). The primary 
function of the auxiliary registers is the generation of 24-bit addresses. They 
can also be used to perform a variety of functions, such as loop counters or 
as 32-bit general-purpose registers that can be modified by the multiplier and 
ALU. Refer to Section 6 for detailed information and examples of the use of 
auxiliary registers in addressing. 

The data page pointer (DP) is a 32-bit register. The eight LSBs of the data 
page pointer are used by the direct addressing mode as a pointer to the page 
of data being addressed. Data pages are 64 k words long with a total of 256 
pages. 
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The 32-bit index registers (IRO and IR1 ) are used by the Auxiliary Register 
Arithmetic Unit (ARAL)) for indexing the address. Refer to Section 6 for ex- 
amples of the use of index registers in addressing. 

The 32-bit block size register (BK) is used by the ARAU in circular ad- 
dressing to specify the data block size. 

The system stack pointer (SP) is a 32-bit register that contains the address 
of the top of the system stack. The SP always points to the last element 
pushed onto the stack. A push performs a preincrement and a pop, a postde- 
crement of the system stack pointer. The SP is manipulated by interrupts, 
traps, calls, returns, and the PUSH and POP instructions. Refer to Section 6.5 
for information about system stack management. 

The status register (ST) contains global information relating to the state 
of the CPU. Typically, operations set the condition flags of the status register 
according to whether the result is zero, negative, etc. This includes register 
load and store operations as well as arithmetic and logical functions. When 
the status register is loaded, however, a bit-for-bit replacement is performed 
on the current contents with the contents of the source operand regardless of 
the state of any bits in the source operand. Therefore, following a load, the 
contents of the status register are identically equal to the contents of the 
source operand. This allows the status register to be easily saved and restored. 
See Table 4.2 for a list and definitions of the status register bits. 

The CPU/DMA interrupt enable register (IE) is a 32-bit register. The 
CPU interrupt enable bits are in locations 10-0. The DMA interrupt enable 
bits are in locations 26-16. A 1 in a CPU/DMA interrupt enable register bit 
enables the corresponding interrupt. A disables the corresponding interrupt. 
Refer to Section 4.1 for bit definitions. 

The CPU interrupt flag register (IF) is also a 32-bit register (see Section 
4.1 ). A 1 in a CPU interrupt flag register bit indicates that the corresponding 
interrupt is set. A indicates that the corresponding interrupt is not set. 

The I/O flags register (lOF) controls the function of the dedicated external 
pins, XFO and XF1. These pins may be configured for input or output, and 
they may also be read from and written to. See Section 4.1 for detailed infor- 
mation. 

The repeat counter (RC) is a 32-bit register used to specify the number of 
times a block of code is to be repeated when performing a block repeat. When 
operating in the repeat mode, the 32-bit repeat start address register 
(RS) contains the starting address of the block of program memory to be re- 
peated and the 32-bit repeat end address register (RE) contains the 
ending address of the block to be repeated. 

The program counter (PC) is a 32-bit register containing the address of the 
next instruction to be fetched. Although the PC is not part of the CPU register 
file, it is a register that can be modified by instructions that modify the program 
flow. 
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3.2 Memory Organization 




The total memory space of the TMS320C30 is 16M (million) 32-bit words. 
Program, data, and I/O space are contained within this 16M-word address 
space, thus allowing tables, coefficients, program code, or data to be stored 
in either RAM or ROM. In this way, memory usage can be maximized and 
memory space allocated as desired. 



3.2.1 RAM, ROM, and Cache 



Figure 3-3 shows how the memory is organized on the TMS320C30. RAM 
blocks and 1 are each 1 K x 32 bits. The ROM block is 4K x 32 bits. Each 
RAM and ROM block is capable of supporting two accesses in a single cycle. 
The separate program buses, data buses, and DMA buses allow for parallel 
program fetches, data reads and writes, and DMA operations. For example: 
the CPU can access two data values in one RAM block and perform an ex- 
ternal program fetch in parallel with the DMA loading another RAM block, all 
within a single cycle. 

A 64 X 32- bit instruction cache is provided to store often repeated sections 
of code, thus greatly reducing the number of off-chip accesses necessary. This 
allows for code to be stored off-chip in slower, lower-cost memories. The 
external buses are also freed for use by the DMA, external memory fetches, or 
other devices in the system. 

Refer to Section 4 for detailed information about the memory and instruction 
cache. 
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3.2.2 Memory Maps 




The memory map is dependent upon whether the processor is running in the 
microprocessor mode (MC/MP = 0) or the microcomputer mode (MC/MP = 
1). The memory maps for these modes are very similar (see Figure 3-4). Lo- 
cations SOOOOOh through 801 FFFh are mapped to the expansion bus. When 
this region is accessed, MSTRB is active. Locations 802000h through 
BOSFFFh are reserved. Locations 804000h through 805FFFh are mapped to 
the expansion bus. When this region is accessed, lOSTRB is active. Locations 
806000h through 807FFFh are reserved. All of the memory-mapped periph- 
eral registers are in locations 808000h through 8097FFh. In both modes, 
RAM block is located at addresses 809800h through 809BFFh, and RAM 
block 1 is located at addresses 809C00h through 809FFFh. Locations 
80A000h through OFFFFFFh are accessed over the external memory port 
(STRB active). 

In microprocessor mode, the 4K on-chip ROM is not mapped into the 
TMS320C30 memory map. Locations Oh through 3Fh consist of interrupt 
vector, trap vector, and reserved locations, all of which are accessed over the 
external memory port (STRB active). Locations 40h through 7FFFFFh are also 
accessed over the external memory port. 

In microcomputer mode, the 4K on-chip ROM is mapped into locations Oh 
through OFFFh. There are 192 locations (Oh through BFh) within this block 
for interrupt vectors, trap vectors, and a reserved space. Locations lOOOh 
through 7FFFFFh are accessed over the external memory port (STRB active). 

Section 4.2 describes the memory maps in greater detail. The peripheral bus 
map and the vector locations for reset, interrupts, and traps are also given. 
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3.2.3 Memory Addressing Modes 

The TMS320C30 supports a base set of general-purpose instructions as well 
as arithmetic-intensive instructions that are particularly suited for digital signal 
processing and other numeric-intensive applications. Refer to Section 6 for 
detailed information on addressing. 

Five groups of addressing modes are provided on the TMS320C30. Six types 
of addressing may be used within the groups, as shown in the following list: 

• General addressing modes: 

- Register. The operand is a CPU register. 

- Short immediate. The operand is a 16-bit immediate value. 

- Direct. The operand is the contents of a 24-bit address. 

- Indirect. An auxiliary register indicates the address of the operand. 

^ • Three-operand addressing modes: 

- Register. Same as for general addressing mode. 

- Indirect. Same as for general addressing mode. 

• Parallel addressing modes: 

- Register. The operand is an extended-precision register. 

- Indirect. Same as for general addressing mode. 

• Long-immediate addressing mode. 

- Long immediate. The operand is a 24-bit immediate value. 

• Conditional branch addressing modes: 

- Register. Same as for general addressing mode. 

- PC-relative. A signed 1 6-bit displacement is added to the PC. 

3.2.4 Instruction Set Summary 

Table 3-2 lists the TMS320C30 instruction set in alphabetical order. Each 
table entry shows the instruction mnemonic, description, and operation. Refer 
to Section 11 for a functional listing of the instructions and individual in- 
struction descriptions. 
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Table 3-2. Instruction Set Summary 



MNEMONIC 


DESCRIPTION 


OPERATION 


ABSF 


Absolute value of a floating-point 
number 


|src| -> Rn 


ABSI 


Absolute value of an integer 


|src| -* Dreg 


ADDC 


Add integers with carry 


src + Dreg + C -» Dreg 


ADDC3 


Add integers with carry (3-operand) 


srcl + src2 + C -+ Dreg 


ADDF 


Add floating-point values 


src + Rn -♦ Rn 


ADDF3 


Add floating-point values (3-operand) 


srcl + src2 -> Rn 


ADDI 


Add integers 


src + Dreg -♦ Dreg 


ADDI3 


Add integers (3-operand) 


src1 + src2 + -♦ Dreg 


AND 


Bitwise logical- AND 


Dreg AND src -» Dreg 


AND3 


Bitwise logical-AND (3-operand) 


srcl ANDsrc2 -* Dreg 


ANDN 


Bitwise logical-AND with complement 


Dreg AND src -* Dreg 


ANDN3 


Bitwise logical-ANDN (3-operand) 


srcl AND src2 -♦ Dreg 


ASH 


Arithmetic shift 


If count > 0: 

(Shift Dreg left by count) -» Dreg 
Else: 

(Shift Dreg right by |count|) -* Dreg 


ASH3 


Arithmetic shift (3-operand) 


If count >. 0: 

(Shift src left by count) -► Dreg 
Else: 

(Shift src right by |count|) -* Dreg 


Bcond 


Branch conditionally (standard) 


If cond = true: 
If Csrc is a register, Csrc -* PC 
If Csrc is a value, Csrc + PC -* PC 

Else, PC + 1 -» PC 


BcondD 


Branch conditionally (delayed) 


If cond = true: 
If Csrc is a register, Csrc -* PC 
If Csrc is a value, Csrc + PC + 3 -♦ PC 

Else, PC + 1 -> PC 




LEGEND: 

src - general addressing modes 

srcl - three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc - conditional-branch addressing modes 

Sreg - register address (any register) 

count - shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 



Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 

ARn - auxiliary register n (AR0-AR7) 

addr - 24-bit immediate address (label) 

cond - condition code (see Section 1 1 ) 

ST - status register 

RE - repeat interrupt register 

RS - repeat start register 

PC - program counter 
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Table 3-2. Instruction Set Summary (Continued) 




MNEMONIC 


DESCRIPTION 


OPERATION 


BR 


Branch unconditionally (standard) 


Value -» PC 


BRD 


Branch unconditionally (delayed) 


Value -*■ PC 


CALL 


Call subroutine 


PC + 1 -> TOS 
Value -► PC 


CALLcond 


Call subroutine conditionally 


If cond = true: 
PC + 1 -♦ TOS 

If Csrc is a register, Csrc -♦ PC 
If Csrc is a value, Csrc + PC -» PC 

Else, PC + 1 -> PC 


CMPF 


Compare floating-point values 


Set flags on Rn - src 


CMPF3 


Compare floating-point values 
(3-operand) 


Set flags on srcl - src2 


CMPI 


Compare integers 


Set flags on Dreg - src 


CIVIPI3 


Compare integers (3-operand) 


Set flags on srcl - src2 


DBcond 


Decrement and branch conditionally 
(standard) 


ARn - 1 -♦ ARn 

If cond = true and ARn >_ 0: 
If Csrc is a register, Csrc -♦ PC 
If Csrc is a value, Csrc + PC -» PC 

Else, PC + 1 -* PC 


DBcondD 


Decrement and branch conditionally 
(delayed) 


ARn - 1 -♦ ARn 

If cond = true and ARn >. 0: 
If Csrc is a register, Csrc -♦ PC 
If Csrc is a value, Csrc + PC + 3 -► PC 

Else, PC + 1 -* PC 


FIX 


Convert floating-point value to integer 


Fix (src) -♦ Dreg 


FLOAT 


Convert integer to floating-point value 


Float(src) -♦ Rn 


IDLE 


Idle until interrupt 


PC + 1 -♦PC 

Idle until next interrupt 


LDE 


Load floating-point exponent 


src(exponent) -* Rn (exponent) 


LDF 


Load floating-point value 


src -♦ Rn 



LEGEND: 

src - general addressing modes 

srcl - three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc - conditional-branch addressing modes 

Sreg - register address (any register) 

count - shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 



Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 

ARn - auxiliary register n (AR0-AR7) 

addr - 24-bit immediate address (label) 

cond - condition code (see Section 1 1 ) 

ST - status register 

RE - repeat interrupt register 

RS - repeat start register 

PC - program counter 
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Table 3-2. Instruction Set Summary (Continued) 



MNEMONIC 


DESCRIPTION 


OPERATION 


LDFcond 


Load floating-point value 
conditionally 


If cond = true, src -* Rn 
Else, Rn is not changed 


LDFI 


Load floating-point value, 
interlocked 


Signal interlocked operation 
src -♦ Rn 


LDI 


Load integer 


src -» Dreg 


LDlconcf 


Load integer conditionally 


If cond = true, src -» Dreg 
Else, Dreg is not changed 


LDII 


Load integer, interlocked 


Signal interlocked operation 
src -♦ Dreg 


LDM 


Load floating-point mantissa 


src(mantissa) -+ Rn(mantissa) 


LSH 


Logical shift 


If count >. 0: 

(Dreg left-shifted by count) -♦ Dreg 
Else: 

(Dreg right-shifted by |count|) -♦ Dreg 


LSH3 


Logical shift (3-operand) 


If count >.0: 

(src left-shifted by count) -* Dreg 
Else: 

(src right-shifted by (count|) -* Dreg 


MPYF 


Multiply floating-point values 


src X Rn -» Rn 


MPYF3 


Multiply floating-point values 
(3-operand) 


srcl X src2 -> Rn 


MPYI 


Multiply integers 


src X Dreg -> Dreg 


MPYI3 


Multiply integers (3-operand) 


srcl X src2 -♦ Dreg 


NEGB 


Negate integer with borrow 


- src - C -» Dreg 


NEGF 


Negate floating-point value 


- src -♦ Rn 


NEGI 


Negate integer 


- src -♦ Dreg 


NOP 


No operation 


Modify src if specified 


NORM 


Normalize floating-point value 


Normalize (src) -♦Rn 


NOT 


Bitwise logical-complement 


src -* Dreg 


OR 


Bitwise logical-OR 


Dreg OR src -* Dreg 


0R3 


Bitwise logical-OR (3-operand) 


src1 OR src2 -* Dreg 




LEGEND: 

src - general addressing modes 

srcl - three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc - conditional-branch addressing modes 

Sreg - register address (any register) 

count - shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 



Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 

ARn - auxiliary register n (AR0-AR7) 

addr - 24-bit immediate address (label) 

cond - condition code (see Section 1 1 ) 

ST - status register 

RE - repeat interrupt register 

RS - repeat start register 

PC - program counter 
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Table 3-2. Instruction Set Summary (Continued) 




MNEMONIC 


DESCRIPTION 


OPERATION 


POP 


Pop integer from stack 


*SP---* Dreg 


POPF 


Pop floating-point value from staci< 


•SP— ► Rn 


PUSH 


Push integer on stack 


Sreg -♦*-i-+ SP 


PUSHF 


Push floating-point value on stack 


Rn -+ * + + SP 


RETl cone/ 


Return from interrupt conditionally 


If cond = true or missing: 

•SP"-» PC 

1 -* ST (GIE) 
Else, continue 


HETScond 


Return from subroutine conditionally 


If cond = true or missing: 

*SP- - PC 
Else, continue 


RND 


Round floating-point value 


Round (src) -* Rn 


ROL 


Rotate left 


Dreg rotated left 1 bit -+ Dreg 


ROLC 


Rotate left through carry 


Dreg rotated left 1 bit through carry -♦ Dreg 


ROR 


Rotate right 


Dreg rotated right 1 bit -* Dreg 


RORC 


Rotate right through carry 


Dreg rotated right 1 bit thru carry -♦ Dreg 


RPTB 


Repeat block of instructions 


src -*■ RE 
1 -> ST (RM) 
Next PC -♦ RS 


RPTS 


Repeat single instruction 


src -♦ RC 
1 -» ST (RM) 
Next PC -> RS 
Next PC -» RE 


SIGI 


Signal, interlocked 


Signal interlocked operation 
Wait for interlock acknowledge 
Clear interlock 


STF 


Store floating-point value 


Rn -* Daddr 


STFl 


Store floating-point value, interlocked 


Rn -* Daddr 

Signal end of interlocked operation 


STI 


Store integer 


Sreg -» Daddr 


STII 


Store integer, interlocked 


Sreg -* Daddr 

Signal end of interlocked operation 



LEGEND: 

src - general addressing modes 

srcl - three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc - conditional-branch addressing modes 

Sreg - register address (any register) 

count - shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 



Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 

ARn - auxiliary register n (AR0-AR7) 

addr - 24-bit immediate address (label) 

cond - condition code (see Section 1 1 ) 

ST - status register 

RE - repeat interrupt register 

RS - repeat start register 

PC - program counter 
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Table 3-2. Instruction Set Summary (Continued) 



MNEMONIC 


DESCRIPTION 


OPERATION 


SUBB 


Subtract integers with borrow 


Dreg - src - C -+ Dreg 


SUBB3 


Subtract integers with borrow 
(3-operand) 


srcl - src2 - C -♦ Dreg 


SUBC 


Subtract integers conditionally 


If Dreg - src ^ 0: 

[(Dreg-src) << 1] OR 1 -♦ Dreg 
Else, Dreg << 1 -► Dreg 


SUBF 


Subtract floating-point values 


Rn - src -*■ Rn 


SUBF3 


Subtract floating-point values 
(3-operand) 


srcl - src2 -♦ Rn 


SUBI 


Subtract integers 


Dreg - src -*■ Dreg 


SUBI3 


Subtract integers (3-operand) 


srcl - src2 -* Dreg 


SUBRB 


Subtract reverse integer with borrow 


src - Dreg - C -♦ Dreg 


SUBRF 


Subtract reverse floating-point value 


src - Rn -* Rn 


SUBRI 


Subtract reverse integer 


src - Dreg -* Dreg 


SWI 


Software interrupt 


Perform emulator interrupt sequence 


TRAPcond 


Trap conditionally 


If cond = true or missing: 
Next PC - • ++ SP 
Trap vector N -♦ PC 
- ST (GIE) 

Else, continue 


TSTB 


Test bit fields 


Dreg AND src 


TSTB3 


Test bit fields (3-operand) 


srcl AND src2 


XOR 


Bitwise exclusive-OR 


Dreg XOR src -♦ Dreg 


X0R3 


Bitwise exclusive-OR (3-operand) 


srcl XOR src2 -♦ Dreg 




LEGEND: 

src - general addressing modes 

srcl - three-operand addressing modes 

src2 - three-operand addressing modes 

Csrc -conditional-branch addressing modes 

Sreg - register address (any register) 

count - shift value (general addressing modes) 

SP - stack pointer 

GIE - global interrupt enable register 

RM - repeat mode bit 

TOS - top of stack 



Dreg - register address (any register) 

Rn - register address (R0-R7) 

Daddr - destination memory address 

ARn - auxiliary register n (AR0-AR7) 

addr - 24-bit immediate address (label) 

cond - condition code (see Section 11) 

ST - status register 

RE - repeat interrupt register 

RS - repeat start register 

PC - program counter 
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Table 3-2. Instruction Set Summary (Continued) 



MNEMONIC 


DESCRIPTION 


OPERATION 


PARALLEL ARITHMETIC VVITH STORE INSTRUCTIONS 


ABSF 

II STF 


Absolute value of a floating-point 


lsrc2| ->■ dsti 
II src3 -* dst2 


ABSI 
IISTI 


Absolute value of an integer 


|src2| -» dstl 
1 1 src3 -♦ dst2 


ADDF3 
II STF 


Add floating-point 


srcl + src2 -♦ dstl 
1 1 src3 -♦ dst2 


ADDIS 

II STI 


Add integer 


srcl + src2 -* dst1 
1 1 src3 -* dst2 


AND3 
II STI 


Bitwise logical- AND 


srcl AND src2 -♦ dstl 
1 1 src3 -* dst2 


ASH3 
11 STI 


Arithmetic shift 


If count >. 0: 

src2 << count -♦ dst1 
11 src3 ->■ dst2 
Else: 

src2 >> IcountI -♦ dst1 
1 1 src3 -► dst2 


FIX 

II STI 


Convert floating-point to integer 


Fix(src2) -♦ dstl 
1 1 src3 -> dst2 


FLOAT 

II STF 


Convert integer to floating-point 


Float(src2) ->■ dstl 
11 src3 -*■ dst2 


LDF 

11 STF 


Load floating-point 


src2 -» dstl 
11 src3 -»dst2 


LDI 

11 STI 


Load integer 


src2 -♦ dstl 
11 src3 -* dst2 


LSH3 
11 STI 


Logical shift 


If count >. 0: 

src2 << count -♦ dstl 
II src3 -♦ dst2 
Else: 

src2 >> IcountI -*■ dstl 
II src3 -» dst2 


MPYF3 
II STF 


Multiply floating-point 


srcl X src2 -* dstl 
11 src3 -♦ dst2 


MPYI3 
11 STI 


Multiply integer 


srcl X src2 -♦ dstl 
11 src3 -► dst2 


NEGF 

11 STF 


Negate floating-point 


0- src2 -* dst1 
II src3 -» dst2 



LEGEND: 

srcl - register addr (R0-R7) 
srcS - register addr (R0-R7) 
dstl - register addr (R0-R7) 



src2- indirect addr (disp = 0, 1, IRO, IR1) 
src4 - indirect addr (disp = 0, 1, IRO, IR1) 
dst2- indirect addr (disp = 0, 1, IRO, IR1) 



3-18 



Architectural Overvie\A/ - Memory Organization 



Table 3-2. Instruction Set Summary (Concluded) 



MNEMONIC 


DESCRIPTION OPERATION 


PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS (Concluded) 


NEGI 
II STI 


Negate integer 


- src2 -♦ dsti 
1 1 src3 -» dst2 


N0T3 
II STI 


Complement 


srcl -» dsti 
1 1 src3 -♦ dst2 


0R3 

II STI 


Bitwise logical-OR 


srcl OR src2 -> dsti 
||src3 -* dst2 


STF 

II STF 


Store floating-point 


srcl -> dsti 
||src3 -+ dst2 


STI 

II STI 


Store integer 


srcl -♦ dsti 
1 1 src3 -»• dst2 


SUBF3 

II STF 


Subtract floating-point 


src1 - src2 -► dsti 
II src3 -►dst2 


SUBI3 
II STI 


Subtract integer 


srcl - src2 -*■ dsti 
II src3 -♦ dst2 


X0R3 

II STI 


Bitwise exclusive-OR 


srcl XOR src2 -♦ dsti 
II src3 -* dst2 


PARALLEL LOAD INSTRUCTIONS 


LDF 

II LDF 


Load floating-point 


src2 ->• dsti 
1 1 src4 -♦ dst2 


LDI 

II LDI 


Load integer 


src2 -♦ dsti 
II src4 -•• dst2 


PARALLEL MULTIPLY AND ADD/SUBTRACT INSTRUCTIONS 


MPYF3 
II ADDF3 


Multiply and add floating-point 


opi X op2 -♦ op3 

II op4 + op5 -* op6 


MPYF3 
II SUBF3 


Multiply and subtract floating-point 


op1 X op2 -» op3 

II op4 - op5 -* op6 


MPYI3 
II ADDI3 


Multiply and add integer 


opi X op2 -* op3 

II op4 + op5 ->■ op6 


MPYI3 
II SUBI3 


Multiply and subtract integer 


opi X op2 -» op3 

II op4 - op5 -* op6 




LEGEND: 

srcl - register addr (R0-R7) src2 - indirect addr (disp = 0, 1, IRO, IR1) 

srcS -register addr (R0-R7) src4- indirect addr (disp = 0, 1, IRO, IR1) 

dsti - register addr (R0-R7) dst2- indirect addr (disp = 0, 1, IRO, IR1) 

op3 - register addr (RO or R1) op6 - register addr (R2 or R3) 

op1,op2,op4,op5 - Two of these operands must be specified using register addr. 
and two must be specified using indirect 
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3.3 Internal Bus Operation 




A large portion of the TMS320C30's high performance is due to the internal 
busing and the parallelism possible because of this busing. The separate 
program buses (PADDR and PDATA), data buses (DADDRl, DADDR2, and 
DDATA), and DMA buses (DMAADDR and DMADATA) allow for parallel 
program fetches, data accesses, and DMA accesses. These buses connect all 
of the physical spaces (on-chip memory, off-chip memory, and on-chip pe- 
ripherals) supported by the TMS320C30. 

The program counter (PC) is connected to the 24-bit program address bus 
(PADDR). The instruction register (IR) is connected to the 32-bit program 
data bus (PDATA). These buses can fetch a single instruction word every 
machine cycle. 

The 24-bit data address buses (DADDRl and DADDR2) and the 32-bit data 
data bus (DDATA) support two data memory accesses every machine cycle. 
The DDATA bus carries data to the CPU over the CPU1 and CPU2 buses. The 
CPU1 and CPU2 buses can carry two data memory operands to the multiplier, 
ALU, and register file every machine cycle. Also internal to the CPU are reg- 
ister buses REG1 and REG2 that can carry two data values from the register 
file to the multiplier and ALU every machine cycle. 

The DMA controller is supported with a 24-bit address bus (DMAADDR) and 
a 32-bit data bus (DMADATA). These buses allow the DMA to perform me- 
mory accesses in parallel with the memory accesses occurring from the data 
and program buses. 
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3.4 External Bus Operation 



The TMS320C30 provides two external interfaces: the primary bus and ex- 
pansion bus. Both consist of a 32-bit data bus and a set of control signals. 
The primary bus has a 24-bit address bus, whereas the expansion bus has a 
1 3-bit address bus. Both buses can be used to address e xtern al program/data 
memory or I/O space. The buses also have an external RDY signal for wait- 
state generation. Additional wait states may be inserted under software con- 
trol. Refer to Section 8 for detailed information on external bus operation. 

The TMS320C30 supports four external interr upts ( iN73-TrTfO), a number of 
internal interrupts, and a nonmaskable external RESET signal. Two external I/O 
flags, XFO and XF1, can be configured as input or output pins under software 
control. These pins are also used by the interlocked operations of the 
TMS320C30. The interlocked-operations instruction group supports multi- 
processor communication (see Section 7 for examples of the use of inter- 
locked instructions). 
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3.5 Peripherals 




All TMS320C30 peripherals are controlled through memory mapped registers 
on a dedicated peripheral bus, composed of a 32-bit data bus and a 24-bit 
address bus. This peripheral bus permits straightforward communication to 
the peripherals. The TI\/IS320C30 peripherals include two timers and two se- 
rial ports. Figure 3-5 shows the peripherals with associated buses and signals. 
Refer to Section 9 for detailed information on the peripherals. 



M S 

E P 

M A 

O C 

R E 
Y 



O 



P 

E 
R 

P 

'h; 

.E. 

R. 
■ A. 

L- 

D 
A 

'T' 
A. 

B. 
U. 
S 



): 



SERIAL PORT 



PORT 
CONTROL REGISTER 



R/X TIMER 
REGISTER 



DATA TRANSMIT 
REGISTER 



DATA RECEIVE 
REGISTER 



TIMER 



GLOBAL 
CONTROL REGISTER 



TIMER PERIOD 
REGISTER 



TIMER COUNTER 
REGISTER 




SERIAL PORT 1 






PORT 
CONTROL REGISTER 








R/X TIMER 
REGISTER 






DATA TRANSMIT 
REGISTER 






DATA RECEIVE 
REGISTER 







FSX1 

DX1 

CLKX1 

FSR1 

DR1 

CLKR1 



^ l» TCLKO 



TIMER 1 






GLOBAL 
CONTROL REGISTER 






TIMER PERIOD 
REGISTER 




TIMER COUNTER 
REGISTER 





■* » TCLK1 



Figure 3-5. Peripheral Modules 
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3.5.1 Timers 



The two timer modules are general-purpose 32-bit timer/event counters, with 
two signaling modes and internal or external clocking. Each timer has an I/O 
pin that can be used as an input clock to the timer or as an output signal dri- 
ven by the timer. The pin may also be configured as a general-purpose I/O 
pin. 



3.5.2 Serial Ports 



The two serial ports are totally independent. They are identical with a com- 
plementary set of control registers controlling each one. Each serial port can 
be configured to transfer 8, 1 6, 24, or 32 bits of data per word. The clock for 
each serial port can originate either internally or externally. An internally 
generated divide-down clock is provided. The serial port pins are configurable 
as general-purpose I/O pins. The serial ports can also be configured as timers. 
A special handshake mode allows TMS320C30s to communicate over their 
serial ports with guaranteed synchronization. 
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3.6 Direct Memory Access (DMA) 

The on-chip Direct IVIemory Access (DMA) controller can read from or write 
to any location in the memory map without interfering with the operation of 
the CPU. Therefore, the TMS320C30 can interface to slow external memories 
and peripherals without reducing throughput to the CPU. The DMA controller 
contains its own address generators, source and destination registers, and 
transfer counter. Dedicated DMA address and data buses allow for minimi- 
zation of conflicts between the CPU and the DMA controller. A DMA opera- 
tion consists of a block or single-word transfer to or from memory. Refer to 
Section 9 for detailed information on the DMA. Figure 3-6 shows the DMA 
controller with associated buses. 
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Figure 3-6. DMA Controller 



In summary, the TMS320C30 is a powerful DSP system because of its inte- 
gration of a powerful CPU, large memories, and sufficient buses to support its 
speed. These along with peripherals such as a DMA controller, two serial 
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ports, and two timers are all contained on a single chip. The total system real 
estate and price have been reduced, providing the user with a true single-chip 
solution. 
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Section 4 

CPU Registers, R/iemory, and Cache 



The CPU register file contains 28 registers that can be operated upon by the 
multiplier and ALU (arithmetic logic unit). Included in the register file are the 
auxiliary registers, extended-precision registers, and index registers. The reg- 
isters in the CPU register file support addressing, floating-point/integer oper- 
ations, stack management, processor status, block repeats, and interrupts. 

The TMS320C30 provides a total memory space of 16M (million) 32-bit 
words. Program, data, and I/O space are contained within this 16l\/l-word 
address space. Two RAM blocks of 1 K x 32 bits each and a ROM block of 4K 
x 32 bits permit two accesses in a single cycle. The memory maps for the 
microcomputer and microprocessor modes are similar, except that the on-chip 
ROM is not used in microprocessor mode. 

A 64 x 32-bit instruction cache stores often repeated sections of code. This 
greatly reduces the number of off-chip accesses necessary and allows code to 
be stored off-chip in slower, lower-cost memories. Three bits are provided in 
the CPU status register to control the clear, enable, or freeze of the cache. 

This section describes in detail each of the CPU registers, the memory maps, 
and the instruction cache. Major topics in this section are as follows: 

• CPU Register File (Section 4.1 on page 4-2) 

- Extended-precision registers (R0-R7) 

- Auxiliary registers (AR0-AR7) 
Index registers (IRO, IR1 ) 

- Block size register (BK) 

- Data page pointer (DP) 
System stack pointer (SP) 

- Status register (ST) 

- CPU/DMA interrupt enable register (IE) 

- CPU interrupt flag register (IF) 
I/O flags register (lOF) 

- Repeat counter (RC) and block repeat registers (RS, RE) 
Program counter (PC) 

• Memory (Section 4.2 on page 4-1 1 ) 

- Memory maps 

- Peripheral bus map 

- Reset/interrupt/trap map 

9 Instruction Cache (Section 4.3 on page 4-15) 

- Cache architecture 

- Cache algorithm 

- Cache control bits 
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4.1 CPU Register File 



The TMS320C30 provides 28 registers in a muitiport register file that is tightly 
coupled to the CPU. The PC is not included in the 28 registers. All of these 
registers can be operated upon by the multiplier and ALU, and can be used 
as general-purpose 32-bit registers. However, the registers also have some 
special functions for which they are more suited than others. For example, the 
eight extended-precision registers are especially suited for maintaining ex- 
tended-precision floating-point results. The eight auxiliary registers support 
a variety of indirect addressing modes and can be used as general-purpose 
32-bit integer and logical registers. The remaining registers provide system 
functions such as addressing, stack management, processor status, interrupts, 
and block repeat. Refer to Section 6 for detailed information and examples 
of the use of CPU registers in addressing. 

The registers names and assigned function are listed in Table 4-1. 

Table 4-1. CPU Registers 



REGISTER 
NAME 


ASSIGNED FUNCTION 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 1 
Extended-precision register 2 
Extended-precision register 3 
Extended- precision register 4 
Extended-precision register 5 
Extended-precision register 6 
Extended-precision register 7 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


Auxiliary register 
Auxiliary register 1 
Auxiliary register 2 
Auxiliary register 3 
Auxiliary register 4 
Auxiliary register 5 
Auxiliary register 6 
Auxiliary register 7 


DP 
IRO 
IR1 
BK 
SP 


Data page pointer 
Index register 
Index register 1 
Block size 
System stack pointer 


ST 

IE 

IF 

lOF 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


RS 
RE 
RC 


Repeat start address 
Repeat end address 
Repeat counter 


PC 


Program counter 
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4.1.1 Extended-Precision Registers (R0-R7) 

The eight extended-precision registers (R0-R7) are capable of storing and 
supporting operations on 32-bit integer and 40-bit floating-point numbers. 
These registers consist of two separate and distinct regions. Bits 39-32 of the 
extended-precision registers are dedicated to the storage of the exponent (e) 
of the floating-point number. Bits 31 -0 store the mantissa of the floating- 
point number. Bit 31 is the sign (s) bit, bits 30-0 are the fraction (f). Any 
instruction that assumes the operands are floating-point numbers uses bits 
39-0. Figure 4-1 illustrates the storage of 40-bit floating-point numbers in the 
extended-precision registers. 

39 32 31 30 



e s f 



^ 



mantissa 
Figure 4-1. Extended-Precision Register Floating-Point Format 



For integer operations, bits 31-0 of the extended-precision registers contain 
the integer (signed or unsigned). Any instruction that assumes the operands 
are either signed or unsigned integers uses only bits 31 -0. Bits 39-32 remain 
unchanged. This is true for all shift operations. The storage of 32-bit integers 
in the extended-precision registers is shown in Figure 4-2. 

39 32 31 



unchanged 



signed or unsigned integer 



Figure 4-2. Extended-Precision Register Integer Format 



4.1.2 Auxiliary Registers (AR0-AR7) 

The eight 32-bit auxiliary registers (AR0-AR7) can be accassed by the CPU 
and modified by the two Auxiliary Register Arithmetic Units (ARAUs). The 
primary function of the auxiliary registers is the generation of 24-bit addresses. 
However, they can also be used to perform a variety of functions, such as loop 
counters in indirect addressing or as 32-bit general-purpose registers that can 
be modified by the multiplier and ALU. Refer to Section 6 for detailed infor- 
mation and examples of the use of auxiliary registers in addressing. 
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4.1.3 Data Page Pointer (DP) 

The data page pointer (DP) is a 32-bit register. The eight LSBs of the data 
page pointer are used by the direct addressing mode as a pointer to the page 
of data being addressed. Data pages are 64 k words long with a total of 256 
pages. Bits 31 - 8 are reserved and should always be kept zero by the user. 

4.1.4 Index Registers (IRO, IR1) 

The 32-bit index registers (IRO and IR1) are used by the Auxiliary Register 
Arithmetic Unit (ARAL!) for indexing the address. Refer to Section 6 for de- 
tailed information and examples of the use of index registers in addressing. 

4.1.5 Block Size Register (BK) 

The 32-bit block size register (BK) is used by the ARAL) in circular addressing 
to specify the data block size (see Section 6.3). 

4.1 .6 System Stack Pointer (SP) 

The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The SP always points to the last element pushed 
onto the stack. The SP is manipulated by interrupts, traps, calls, returns, and 
the PUSH, PUSHF, POP, and POPF instructions. Pushes and pops of the 
stack perform pre-increment and post-decrement on all 32 bits of the stack 
pointer. However, only the 24 LSBs are used as an address. Refer to Section 
6.5 for information about system stack management. 

4.1 .7 Status Register (ST) 

The status register (ST) contains global information relating to the state of the 
CPU. Typically, operations set the condition flags of the status register ac- 
cording to whether the result is zero, negative, etc. This includes register load 
and store operations as well as arithmetic and logical functions. When the 
status register is loaded, however,a bit-for-bit replacement is performed of the 
current contents with the contents of the source operand regardless of the 
state of any bits in the source operand. Therefore, following a load, the con- 
tents of the status register are identically equal to the contents of the source 
operand. This allows the status register to be easily saved and restored. At 
system reset, is written to this register. 

The format of the status register is shown in Figure 4-3. Table 4-2 defines the 
status register bits, their names and functions. 
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31 
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15 
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4 


3 


2 
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XX 


XX 


GIE 


CC 


CE 


CF 


XX 


RM 


OVM 


LUF 


LV 


UF 


N 


Z 


V 


C 



R/W R/W R/W R/W 



R/W R/W R/W R/W R/W R/W R/W R/W R/W 



NOTE: XX = reserved bit. 

R = read, W = write. 



Figure 4-3. Status Register 
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Table 4-2. Status Register Bits Summary 



BIT 


NAME 


FUNCTION 





C 


Carry flag 


1 


V 


Overflow flag 


2 


Z 


Zero flag 


3 


N 


Negative flag 


4 


UF 


Floating-point underflow flag 


5 


LV 


Latched overflow flag 


6 


LUF 


Latched floating-point underflow flag 


7 


OVM 


Overflow mode flag. This flag affects only the integer operations. If 
OVM = 0, the overflow mode is turned off; integer results that over- 
flow are treated in no special way. If OVM = 1, integer results over- 
flowing in the positive direction are set to the most positive 32-bit 
two's-complement number (7FFFFFFFh). If OVM = 1, integer results 
overflowing in the negative direction are set to the most negative 
32-bit two's-complement number (SOOOOOOOh). Note that the func- 
tion of V and LV is independent of the setting of OVM. 


8 


RM 


Repeat mode flag. If RM = 1, the PC is being modified in either the 
repeat block or repeat-single mode. 


9 


Reserved 


Read as 0. 


10 


CF 


Cache Freeze. When CF = 1, the cache is frozen. If the cache is en- 
abled (CE = 1), fetches from the cache are allowed, but no modifica- 
tion of the state of the cache is performed. This function can be used 
to save frequently used code resident in the cache. At reset, is writ- 
ten to this bit. Cache clearing (CC=1 ) is allowed when CF=0. 


11 


CE 


Cache Enable. CE = 1 enables the cache, allowing the cache to be 
used according to the LRU cache algorithm. CE = disables the 
cache; no update or modification of the cache can be performed. No 
fetches are made from the cache. This function is useful for system 
debug. At system reset, is written to this bit. Cache clearing (CC 
= 1) is allowed when CE=0. 


12 


CC 


Cache Clear. CC = 1 invalidates all entries in the cache. This bit is 
always cleared after it is written to and thus always read as 0. At reset, 
is written to this bit. 


13 


GIE 


Global interrupt enable. If GIE = 1, the CPU responds to an enabled 
interrupt. If GIE = 0, the CPU does not respond to an enabled inter- 
rupt. 


14-15 


Reserved 


Read as 0. 


16-31 


Reserved 


Value undefined. 
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4.1.8 CPU/DMA interrupt Enable Register (IE) 

The CPU/DMA interrupt enable register (IE) is a 32-bit register (see Figure 
4-4). The CPU interrupt enable bits are in locations 10-0. The DMA interrupt 
enable bits are in locations 26-16. A 1 in a CPU/DMA interrupt enable reg- 
ister bit enables the corresponding interrupt. A disables the corresponding 
interrupt. At reset, is written to this register. Table 4-3 defines the register 
bits, the bit names, and the bit functions. 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


EDINT 
(DMA) 


ETINT1 
(DMA) 


ETINTO 
(DMA) 


ERINT1 
(DMA) 


EXINT1 
(DMA) 


ERINTO 
(DMA) 


EXINTO EINT3 EINT2 EINT1 EINTO 
(DMA) (DMA) (DMA) (DMA) (DMA) 


15 


14 


13 


12 


R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 
11 10 9 8 7 6 5 43210 


XX 


XX 


XX 


XX 


XX 


EDINT 
(CPU) 


ETINT1 
(CPU) 


ETINTO 
(CPU) 


ERINT1 
(CPU) 


EXINT1 
(CPU) 


ERINTO 
(CPU) 


EXINTO 
(CPU) 


EINT3 
(CPU) 


EINT2 
(CPU) 


EINT1 
(CPU) 


EINTO 
(CPU) 




R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 



NOTE: XX = reserved bit, read as 0. 
R = read, W = write. 



Figure 4-4. CPU/DMA Interrupt Enable Register (IE) 
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Table 4-3. IE Register Bits Summary 



BIT 


NAME 


FUNCTION 





EINTO 


Enable external interrupt (CPU) 


1 


EINT1 


Enable external interrupt 1 (CPU) 


2 


EINT2 


Enable external interrupt 2 (CPU) 


3 


EINT3 


Enable external interrupt 3 (CPU) 


4 


EXINTO 


Enable serial port transmit interrupt (CPU) 


5 


ERINTO 


Enable serial port receive interrupt (CPU) 


6 


EXINT1 


Enable serial port 1 transmit interrupt (CPU) 


7 


ERINT1 


Enable serial port 1 receive interrupt (CPU) 


8 


ETINTO 


Enable timer interrupt (CPU) 


9 


ETINT1 


Enable timer 1 interrupt (CPU) 


10 


EDINT 


Enable DMA controller interrupt (CPU) 


11-15 


Reserved 


Value undefined 


16 


EINTO 


Enable external interrupt (DMA) 


17 


EINT1 


Enable external interrupt 1 (DMA) 


18 


EINT2 


Enable external interrupt 2 (DMA) 


19 


EINT3 


Enable external interrupt 3 (DMA) 


20 


EXINTO 


Enable serial port transmit interrupt (DMA) 


21 


ERINTO 


Enable serial port receive interrupt (DMA) 


22 


EXINT1 


Enable serial port 1 transmit interrupt (DMA) 


23 


ERINT1 


Enable serial port 1 receive interrupt (DMA) 


24 


ETINTO 


Enable timer interrupt (DMA) 


25 


ETINT1 


Enable timer 1 interrupt (DMA) 


26 


EDINT 


Enable DMA controller interrupt (DMA) 


27-32 


Reserved 


Value undefined 



4.1.9 CPU Interrupt Flag Register (IF) 

The 32-bit CPU interrupt flag register (IF) is shown in Figure 4-5. A 1 In a 
CPU interrupt flag register bit indicates that the corresponding interrupt is set. 
The IF bits are set to 1 when an interrupt occurs. They may also be set to 1 
through software to cause an interrupt. A indicates that the corresponding 
interrupt is not set. If a is written to an interrupt flag register bit, the corre- 
sponding interrupt is cleared. At reset, is written to this register. Table 4-4 
lists the bit fields, bit field names, and bit field functions of the CPU interrupt 
flag register. 
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31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 14 13 12 11 10 9 


8 


7 


6 


5 


4 


3 2 1 





XX 


XX 


XX 


XX 


XX 


DINT TINT1 


TINTO 


RINT1 


XINT1 


RINTO 


XINTO 


INT3 


INT2 


INT1 


INTO 



R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 



NOTE: XX = reserved bit, read as 0. 
R = read, W = write. 



Figure 4-5. CPU Interrupt Flag Register (IF) 



Table 4-4. IF Register Bits Summary 



BIT 


NAME 


FUNCTION 





INTO 


External interrupt flag 


1 


INT1 


External interrupt 1 flag 


2 


INT2 


External interrupt 2 flag 


3 


INT3 


External interrupt 3 flag 


4 


XINTO 


Serial port transmit interrupt flag 


5 


RINTO 


Serial port receive interrupt flag 


6 


XINT1 


Serial port 1 transmit interrupt flag 


7 


RINT1 


Serial port 1 receive interrupt flag 


8 


TINTO 


Timer interrupt flag 


9 


TINT1 


Timer 1 interrupt flag 


10 


DINTO 


DMA channel interrupt flag 


11-31 


Reserved 


Value undefined 



4.1.10 I/O Flags Register (lOF) 



The I/O flags register (lOF) controls the function of the dedicated external 
pins, XFO and XF1. These pins may be configured for input or output (see 
Table 4-5). They may also be read from and written to. At reset, is written 
to this register. The bit fields, bit field names, and bit field functions are shown 
in Table 4-5. 
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XX 


XX 
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XX 


XX 


XX 


XX 


XX 


15 14 13 12 11 10 9 8 


7 


6 


5 


4 


3 


2 


1 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


INXF1 


0UTXF1 


T/0XF1 


XX 


INXFO 


OUTXFO 


T/OXFO 


XX 



R/W R/W 



R/W R/W 



NOTE: XX = reserved bit, read as 0. 
R = read, W = write. 



Figure 4-6. I/O Flag Register (lOF) 
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Table 4-5. lOF Register Bits Summary 



BIT 


NAME 


FUNCTION 





Reserved 


Read as 0. 


1 


T/OXFO 


If T/OXFO = 0, XFO is configured as a general-purpose input pin. 
If T/OXFO = 1, XFO is configured as a general-purpose output pin. 


2 


OUTXFO 


Data output on XFO. 


3 


INXFO 


Data input on XFO. A write has no effect. 


4 


Reserved 


Read as 0. 


5 


T/0XF1 


IfT/OXFI = 0, XF1 is configured as a general-purpose input pin. 
If T/0XF1 = 1, XF1 is configured as a general-purpose output pin. 


6 


OUTXFl 


Data output on XFl . 


7 


INXF1 


Data input on XFl . A write has no effect. 


8-31 


Reserved 


Read as 0. 



4.1.11 Repeat Counter (RC) and Block Repeat Registers (RS, RE) 

The repeat counter (RC) is a 32-bit register used to specify the number of 
times a block of code is to be repeated when performing a block repeat. 

The repeat start address register (RS) is a 32-bit register containing the start- 
ing address of the block of program memory to be repeated when operating 
in the repeat mode. 

The 32-bit repeat end address register (RE) contains the ending address of the 
block of program memory to be repeated when operating in the repeat mode. 

4.1.12 Program Counter (PC) 

The program counter (PC) is a 32-bit register containing the address of the 
next instruction to be fetched. While the program counter is not part of the 
CPU register file, it is a register that can be modified via instructions that mo- 
dify the program flow. 

4.1.13 Reserved Bits and Compatibility 

In order to retain compatibility with future members of the TMS320C3X family 
of microprocessors, reserved bits that are read as zero must be written as zero. 
Reserved bits that have an undefined value must not have their current value 
modified. In other cases, the user should maintain the reserved bits as speci- 
fied. 
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4.2 Memory 



The total memory space of the TMS320C30 is 16M (million) 32-bit words. 
Program, data, and I/O space are contained within this, allowing tables, co- 
efficients, program code, or data to be stored in either RAM or ROM. In this 
way, memory usage can be maximized and memory space allocated as desired. 

RAM blocks and 1 are each 1 K x 32 bits. The ROM block is 4K x 32 bits. 
Each RAM and ROM block is capable of supporting two accesses in a single 
cycle. The separate program buses, data buses, and DMA buses allow for 
parallel program fetches, data reads/writes, and DMA operations. This is cov- 
ered in detail in Section 10.3. 



4.2.1 Memory Maps 



The memory map is dependent upon whether the processor is running in the 
microprocessor mode (MC/MP = 0) or the microcomputer mode (MC/MP = 
1 ). The memory maps for these modes are very similar (see Figure 4-7). Lo- 
cations SOOOOOh through 801 FFFh are mapped to the expansion bus. When 
this region is accessed, MSTRB is active. Locations 802000h through 
803FFFh are reserved. Locations 804000h through 805FFFh are mapped to 
the expansion bus. When this region is accessed, lOSTRB is active. Locations 
806000h through 807FFFh are reserved. All of the memory-mapped periph- 
eral registers are in locations 808000h through 8097FFh. In both modes, 
RAM block is located at addresses 809800h through 809BFFh, and RAM 
block 1 is located at addresses 809C00h through 809FFFh. Memory locations 
SOAOO Oh through OFFFFFFh are accessed over the external memory port 
(STRB active). 

In microprocessor mode, the 4K on-chip ROM is not mapped into the 
TMS320C30 memory map. Locations Oh through 3Fh consist of interrupt 
vector, trap vector, and reserved locations, all of which are accessed over the 
external memory port (STRB active). Locations 40h through 7FFFFFh are also 
accessed over the external memory port. 

In microcomputer mode, the 4K on-chip ROM is mapped into locations Oh 
through OFFFh. There are 192 locations (Oh through BFh) within this block 
for interrupt vectors, trap vectors, and a reserved space. Locations lOOOh 
through 7FFFFFh are accessed over the external memory port (STRB active). 

Reserved portions of the TMS320C30 memory space and reserved peripheral 
bus addresses should not be read and written by the user. Doing so may 
cause the TMS320C30 to halt operation and require a system reset to restart. 
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Figure 4-7. Memory Maps 
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4.2.2 Peripheral Bus Map 



The memory-mapped peripheral registers are located starting at address 
808000h. The peripheral bus memory map is shown in Figure 4-8. Each pe- 
ripheral occupies a 16-word region of the memory map. Locations 80801 Oh 
through 80801 Fh and locations 808070h through 8097FFh are reserved. 



808000h 
80800Fh 


DMA CONTROLLER REGISTERS 
(16) 


80801 Oh 
80801 Fh 


RESERVED 
(16) 


808020h 
80802Fh 


TIMER REGISTERS 
(16) 


808030h 
80803Fh 


TIMER 1 REGISTERS 
(16) 


808040h 
80804Fh 


SERIAL PORTO REGISTERS 
(16) 


808050h 
80805Fh 


SERIAL PORT 1 REGISTERS 
(16) 


808060h 
80806Fh 


PRIMARY AND EXPANSION PORT 
REGISTERS (16) 


808070h 
8097FFh 


RESERVED 




Figure 4-8. Peripheral Bus Memory Map 



4.2.3 Reset/Interrupt/Trap Vector Map 

The addresses for the reset, interrupt, and trap vectors are Oh through 3Fh, as 
shown in Figure 4-9. The vectors stored in these locations are the addresses 
of the start of the respective reset, interrupt, and trap routines. For example, 
at reset, the contents of memory location Oh (the reset vector) are loaded into 
the PC and execution begins from that address. 

Traps 28-31 are reserved and should not be used by the user. 
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OOh 


RESET 


01 h 
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RINTO 
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XINTO 


08h 


RINTl 


09h 


TINTO 


OAh 


TINT1 


OBh 


DINT 


OCh 
1Fh 


RESERVED 


20h 


TRAPO 






3Bh 


TRAP 27 


3Ch 


TRAP 28 (Reserved) 


3Dh 


TRAP 29 (Reserved) 


3Eh 


TRAP 30 (Reserved) 


3Fh 


TRAP 31 (Reserved) 



Figure 4-9. Reset, Interrupt, and Trap Vector Locations 
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4.3 Instruction Cache 

A 64 X 32-bit instruction cache allows for maximum system performance with 
minimal system cost. The instruction cache stores sections of code that can 
be fetched when repeatedly accessing time-critical code. This greatly reduces 
the number of off-chip accesses necessary and allows for code to be stored 
off-chip in slower, lower-cost memories. The external buses are also freed 
from program fetches, so they can be used by the DMA or other system ele- 
ments. 

The cache can operate in a completely automatic fashion without the need for 
user intervention. A form of the LRU (least-recently-used) cache update al- 
gorithm is used (see Section 4.3.2). 

4.3.1 Cache Architecture 

The instruction cache (see Figure 4-10) contains 64 32-bit words of RAM. 
The cache is divided into two 32-word segments. Associated with each seg- 
ment is a 19-bit segment start address (SSA) register. For each word in the 
cache, there is a corresponding single-bit: Present (P) flag. 
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When the CPU requests an instruction word from external memory, a check 
is made to determine if the word is already contained in the instruction cache. 
The partitioning of an instruction address as used by the cache control algo- 
rithm is shown in Figure 4-11. The 19 most-significant bits of the instruction 
address are used to select the segment and the 5 least-significant bits define 
the address of the instruction word within the pertinent segment. The 19 
MSBs of the instruction address are compared with the two segment start 
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address (SSA) registers. If a match is found, a check is made of the relevant 
P flag. The P flag indicates whether or not the word within a particular seg- 
ment is already present in cache memory. 



23 5 4 


segment start address 
(SSA) 


instruction word 
address within segment 




Figure 4-11. Address Partitioning for Cache Control Algorithm 



If there is no match, one of the segments must be replaced by the new data. 
The segment replaced in this circumstance is determined by the LRU (least- 
recently-used) algorithm. The LRU stack (see Figure 4-10) is maintained for 
this purpose. 

The LRU stack tracks which of the two segments qualifies as the least-re- 
cently-used after each access to the cache, therefore the stack contains either 
0,1 or 1,0. Each time a segment is accessed, its segment number is removed 
from the LRU stack and pushed on the top of the LRU stack. Therefore, the 
number at the top of the stack is the most-recently-used segment number and 
the number at the bottom of the stack is the least-recently-used segment 
number. 

At system reset, the LRU stack is initialized with at the top, 1 at the bottom, 
and all P flags in the instruction cache are cleared. If both SSA registers are 
equal (due to system reset conditions) and a cache hit occurs, the instruction 
word is fetched from the most recently used segment. 

When a replacement is necessary, the least-recently-used segment is selected 
for replacement. Also, the 32 P flags for the segment to be replaced are set 
to 0, and the segment's SSA register is replaced with the 19 MSBs of the in- 
struction address. ,* 



4.3.2 Cache Algorithm 



When the TMS320C30 requests an instruction word from external memory, 
two possible actions occur: a cache hit or a cache miss. These are described 
in the following list: 

• Cache Hit. The requested instruction is contained within the cache 
and the following actions occur: 

1 ) The instruction word is read from the cache. 

2) The segment number of the segment within which the word is 
contained is removed from the LRU stack and pushed to the top 
of the LRU stack, thus moving the other segment number to the 
bottom of the stack. 

• Cache Miss. The instruction is not contained in the cache. Types of 
cache miss are: 

1) Word Miss. The segment address register matches the instruction 
address, but the relevant P flag is not set. The following actions 
occur in parallel: 
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- The instruction word is read from memory and copied into 
the cache. 

- The segment number of the segment within which the word 
is contained is removed from the LRU stack and pushed to 
the top of the LRU stack, thus moving the other segment 
number to the bottom of the stack. 

- The relevant P flag is set. 

2) Segment Miss. Neither of the segment addresses matches the in- 
struction address. The following actions occur in parallel: 

- The least- recently- used segment is selected for replacement. 
The P flags for all 32 words are cleared. 

- The SSA register for the selected segment is loaded with the 
19 MSBs of the address of the requested instruction word. 

- The instruction word is fetched and copied into the cache. 
It goes into the appropriate word of the least-recently-used 
segment. The P flag for that word is set 1 . 

- The segment number of the segment containing the instruc- 
tion word is removed from the LRU stack and pushed to the 
top of the LRU stack, thus moving the other segment number 
to the bottom of the stack. 

Only instructions may be fetched from the program cache. All reads and writes 
of data in memory bypass the cache. Program fetches from internal memory 
do not modify the cache and will not generate cache hits or misses. The pro- 
gram cache is a single-access memory block. Dummy program fetches (i.e., 
following a branch) are treated by the cache as valid program fetches and can 
generate cache misses and cache updates. 

Care should be taken when using self-modifying code. If an instruction re- 
sides in cache and the corresponding location in primary memory is modified, 
the copy of the instruction in cache is not modified. 

More efficient use of the cache can be made by aligning program code on 32 
word address boundaries. This can be done using the ALIGN directive when 
coding assembly language. 

4.3.3 Cache Control Bits 

Three cache control bits are located in the CPU status register: the cache clear 
bit (CC), cache enable bit (CE), and the cache freeze bit (CF). 

Cache Clear Bit (CC). Writing a 1 to the cache clear bit (CC) invalidates 
all entries in the cache. All P flags in the cache are cleared. The CC bit is al- 
ways cleared after the cache is cleared. It is therefore always read as a 0. At 
reset the cache is cleared and is written to this bit. 

Cache Enable Bit (CE). Writing a 1 to this bit enables the cache. When 
enabled, the cache is used according to the previously described cache algo- 
rithm. Writing a to the cache enable bit disables the cache; no updates or 
modification of the cache can be performed. Specifically, no SSA register 
updates are performed, no P flags are modified (unless CC = 1 ), and the LRU 
stack is not modified. Writing a 1 to CC when the cache is disabled will clear 
the cache, and thus the P flags. No fetches are made from the cache when the 
cache is disabled. At reset, is written to this bit. 
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Cache Freeze Bit (CF). When CF = 1, the cache is frozen. If, in addition, 
the cache is enabled, fetches from the cache are allowed, but no modification 
of the state of the cache is performed. Specifically, no SSA register updates 
are performed, no P flags are modified (unless CC = 1 ), and the LRU stack is 
not modified. This function can be used to keep frequently used code resident 
in the cache. Writing a 1 to CC when the cache is frozen will clear the cache, 
and thus the P flags. At reset, is written to this bit. 
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Table 4-6 defines the effect of the CE and CF bits used in combination. 
Table 4-6. Combined Effect of the CE and CF Bits 



CE 


CF 


EFFECT 





1 
1 




1 



1 


Cache not enabled 

Cache not enabled 

Cache enabled and not frozen 

Cache enabled and frozen 
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Section 5 

Data Formats and Floating-Point Operation 



Data is organized in the TMS320C30 architecture to provide three funda- 
mental data types: integer, unsigned-integer, and floating-point. Note that the 
terms, integer and signed-integer, are considered to be equivalent. The 
TMS320C30 supports short and single-precision formats for signed and un- 
signed integers. It also supports short, single-precision and extended- 
precision formats for floating-point data. 

Floating-point operations provide convenient and trouble-free computations 
while maintaining accuracy and precision. The TIVIS320C30 implementation 
of floating-point arithmetic allows for floating-point operations at integer 
speeds. The floating-point capability can prevent problems with overflow, 
operand alignment, and other burdensome tasks common in integer oper- 
ations. 

This section discusses in detail the data formats and floating-point operations 
supported on the TMS320C30. Major topics in this section are as follows: 

• Integer Formats (Section 5.1 on page 5-2) 

® Unsigned-Integer Formats (Section 5.2 on page 5-3) 

• Floating-Point Formats (Section 5.3 on page 5-4) 

© Floating-Point Multiplication (Section 5.4 on page 5-9) 

© Floating-Point Addition and Subtraction (Section 5.5 on page 5-13) 

© Normalization (Section 5.6 on page 5-17) 

© Rounding (Section 5.7 on page 5-20) 

© Floating-Point to Integer Conversions (Section 5.8 on page 5-22) 

© Integer to Floating-Point Conversions (Section 5.9 on page 5-24) 
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5.1 Integer Formats 



The TMS320C30 supports two integer formats: a 1 6-bit short integer format 
and a 32-bit single-precision integer format. When extended- precision regis- 
ters are used as integer operands only bits 31 -0 are used; bits 39-32 remain 
unchanged and unused. 



5.1.1 Short Integer Format 




The short integer format is a 1 6-bit two's-complement integer format, used for 
immediate integer operands. For those instructions that assume integer oper- 
ands, this format is sign-extended to 32 bits (see Figure 5-1 ). The range of 
an integer si, represented in the short integer format, is -2^^ <, si <, 2^° -1. 
In Figure 5-1, s=signed bit. 

15 



Short Integer Format 
31 16 15 



SSS5S5SSSSSS5S 



Sign Extension of a Short Integer 

Figure 5-1 . Short Integer Format and Sign Extension of Short 

Integer 



5.1.2 Single-Precision Integer Format 

In the single-precision integer format, the integer is represented in two's- 
complement notation. The range of an integer sp, represented in the single- 
precision integer format, is -2^'' ^ sp ^ 2^'^ -1. Figure 5-2 shows the 
single-precision integer format. 

31 



Figure 5-2. Single-Precision Integer Format 
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5.2 Unsigned-Integer Formats 

Two unsigned-integer formats are supported on the TMS320C30: a 16-bit 
short format and a 32-bit single-precision format. In extended-precision reg- 
isters, the unsigned-integer operands use only bits 31-0; bits 39-32 remain 
unchanged. 

5.2.1 Short Unsigned-Integer Format 

Figure 5-3 shows the 16-bit short unsigned-integer format, used for immedi- 
ate unsigned-integer operands. For those instructions that assume un- 
signed-integer operands, this format is zero-filled to 32 bits. In Figure 5-3 
below, X = MSB (1 orO). 

15 



Short Unsigned-Integer 
Format 
31 16 15 




00000000000000 


X 



Zero Fill of a Short Unsigned Integer 
Figure 5-3. Short Unsigned-Integer Format and Zero Fill 

5.2.2 Single-Precision Unsigned-Integer Format 

In the single-precision unsigned-integer format, the number is represented 
a 32-bit value, as shown in Figure 5-4. 

31 



as 



Figure 5-4. Single-Precision Unsigned-Integer Format 
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5.3 Floating-Point Formats 



All TMS320C30 floating-point formats consist of three fields: an exponent 
field (e), a single sign-bit field (s), and a fraction field (f). These are stored 
as shown in Figure 5-5. The exponent field is a two's-complement number. 
The sign field and fraction field may be considered as one unit and referred to 
as the mantissa field (man). The mantissa is used to represent a normalized 
two's-complement number. In a normalized representation, a most-signifi- 
cant nonsign bit is implied, thus providing an additional bit of precision. The 
value of a floating-point number x as a function of the fields e, s, and f is given 
as 




X = 01 ./ X 26 if 5 = 

^0.f X 2^ ifs = 1 

if e = most negative two's-complement value 

for the specified exponent field width. 



e 


s 


f 



U- 



man (mantissa) - 



Figure 5-5. Generic Floating-Point Format 



Three floating-point formats are supported on the TMS320C30. The first is a 
short floating-point format for immediate floating-point operands, consisting 
of a 4-bit exponent, 1 sign bit, and an 11 -bit fraction. The second is a sin- 
gle-precision format consisting of an 8-bit exponent, 1 sign bit, and a 23-bit 
fraction. The third is an extended-precision format consisting of an 8-bit ex- 
ponent, 1 sign bit, and a 31 -bit fraction. 

5.3.1 Short Floating-Point Format 

In the short floating-point format, floating-point numbers are represented by 
a two's-complement 4-bit exponent field (e) and a two's-complement 12-bit 
mantissa field {man) with an implied most-significant nonsign bit. 



15 12 1110 



u 



4 



Figure 5-6. Short Floating-Point Format 
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Operations are performed with an implied binary point between bits 1 1 and 
10. When the implied most-significant nonsign bit is made explicit, it is lo- 
cated to the immediate left of the binary point. The floating-point two's- 
complement number x in the short floating-point format is given by 



X = 0^.f X 2^ 

1 0./ X 26 





if5 = 
ifs = 1 
if e = -8, 5 = 0, /■ = 



The following reserved values must be used to represent zero in the short 
floating-point format: 

e= -8 

5 = 

/= 

The following examples illustrate the range and precision of the short float- 
ing-point format: 

Most Positive: x = (2 - 2-''l) x 2 ^ = 2.5594 x 10^ 

Least Positive: x = 1 x 2-^ = 7.8125 x 10 -^ 

Least Negative: x=(-1-2-''1) x 2 "^ = -7.8163 x lO'^ 

Most Negative: x = -1 y. 7? = -2.5600 x 10 ^ 




5.3.2 Single-Precision Floating-Point Format 

In the single-precision format, the floating-point number is represented by an 
8-bit exponent field (e) and a two's-complement 24-bit mantissa field {man) 
with an implied most-significant nonsign bit. 

Operations are performed with an implied binary point between bits 23 and 
22. When the implied most-significant nonsign bit is made explicit, it is lo- 
cated to the immediate left of the binary point. The floating-point number x 
is given by: 



X = 01 ./ X 2^ 
lO.f X 2^ 


31 



if5 = 
if5 = 1 
if e = -128, s = 0, /= 



24 23 22 



e 


s f 



man 



Figure 5-7. Single-Precision Floating-Point Format 



The following reserved values must be used to represent zero in the single- 
precision floating-point format: 

e = - 1 28 

5 = 

/=0 
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The following examples illustrate the range and precision of the single-preci- 
sion floating-point format. 

Most Positive: x = (2 - 2-23) ^2 ^27 = 3.4028234 x 10^8 

Least Positive: x = 1 x 2-''27 = 5.8774717 x 10 "39 

Least Negative: x = (-1-2-23) ^ 2 -'•27 = -5.8774724 x 10-39 

Most Negative: x = -2 x 2^27 = -3.4028236 x 10 38 

5.3.3 Extended-Precision Floating-Point Format 

In the extended-precision format, the floating-point number is represented by 
an 8-bit exponent field (e) and a 32-bit mantissa field {man) with an implied 
most-significant nonsign bit. 

Operations are performed with an implied binary point between bits 31 and 
30. When the implied most-significant nonsign bit is made explicit, it is lo- 
cated to the immediate left of the binary point. The floating-point number x 
is given by 

X = 01 .^ x 2^ if 5 = 

1 0.^ X 2^ if 5 = 1 

if e = -1 28, s = 0, f = 



39 32 31 30 



Figure 5-8. Extended-Precision Floating-Point Format 



The following reserved values must be used to represent zero in the extend- 
ed-precision floating-point format: 

e = - 1 28 

5 = 

f=0 

The following examples illustrate the range and precision of the extended- 
precision floating-point format: 

Most Positive: x = (2 - 2-31) x2 ^27 = 3.4028236683 x 1038 

Least Positive: x = 1 x 2-''27 = 5.8774717541 x 10-39 

Least Negative: x = (-1-2-31) ^ 1ATI ^ -5.8774717569 x 10*39 

Most Negative: x = -2 x 2''27 = -3.4028236691 x 10 38 
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5.3.4 Conversion Betv\/een Floating-Point Formats 

Floating-point operations assume several different formats for inputs and 
outputs. These formats often require conversion from one floating-point for- 
mat to another (e.g., short floating-point format to extended-precision float- 
ing-point format). Format conversions automatically occur in hardware, with 
no overhead, as a part of the floating-point operations. The four conversions 
are shown below with examples of the conversion. When a floating-point 
format zero is converted to a greater- precision format, it is always converted 
to a valid representation of zero in that format. In the below figures, S = sign 
bit of the exponent. 

• Short floating-point format conversion to single-precision 
floating-point format. 



15 



12 11 10 







S XX X 


y 


y y 




Short Floating-Point Format 



31 


27 24 


23 


22 


12 


11 





S 


SSSxxx X 


y 


y 


y 









Single-Precision Floating-Point Format 

In this format, the exponent field is sign-extended and the fraction field 
filled with zeros. 



Short floating-point format conversion to extended-precision 
floating-point format. 



15 



1211 10 
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Short Floating-Point Format 
39 35 32 31 30 20 19 



S SSSxxx X 


y 


y y 






Extended -Precision Floating-Point Format 

The exponent field in this format is sign-extended and the fraction field 
filled with zeros. 
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• Single-precision floating-point format conversion to extend- 
ed-precision floating-point format. 

31 24 23 22 



X X y y y 



39 



Single-Precision Floating-Point Format 
32 31 30 



8 7 



X X 


y 


y y 






Extended -Precision Floating-Point Format 
The fraction field is filled with zeros. 

• Extended-precision floating-point format conversion to sin- 
gle-precision floating-point format. 

39 32 31 30 8 7 



X X y y y z z 



31 



Extended- Precision Floating-Point Format 
24 23 22 



X X y y y 



Single-Precision Floating-Point Format 
The fraction field is truncated. 
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5.4 Floating-Point Multiplication 

A floating-point number a can be written in floating-point format as the fol- 
lowing formula, where a( man) is the mantissa and a(exp) is the exponent. 

a = a (ma/7) x 2^(^^P) 
The product of a and b is c, defined as 

c = a X b = a{man) x h{man) x 2 (a(exp)+b {exp)) 

c{man) = a{man) x b{man) 
c(exp) = a (exp) + b(exp) 

When performing floating-point multiplication, source operands are always 
assumed to be in the single-precision floating-point format. If the source of 
the operands is in short floating-point format, it is extended to the single- 
precision floating-point format. If the source of the operands is in extend- 
ed-precision floating-point format, it is truncated to single-precision format. 
These conversions automatically occur in hardware with no overhead. All re- 
sults of floating-point multiplications are in the extended-precision format. 
These multiplications occur in a single cycle. 

A flowchart for floating-point multiplication is shown in Figure 5-9. In step 
1, the 24-bit source operand mantissas are multiplied, producing a 50-bit re- 
sult c(man). (Note that input and output data are always represented as nor- 
malized numbers.) In step 2, the exponents are added, yielding c{exp). Steps 
3 through 6 check for special cases. Step 3 checks for whether c(man) in 
extended-precision format is equal to zero. If c{man) is zero, step 7 sets 
c(exp) to -128, thus yielding the representation for zero. 

Steps 4 and 5 normalize the result. If a right shift of one is necessary, then in 
step 8, c{man) is right-shifted one bit and one is added to c(exp). If a right 
shih of two is necessary, then in step 9, c{man) is right-shifted two bits and 
two is added to c(exp). Step 6 occurs when the result is normalized. 

In step 10, c{man) is set in the extended -precision floating-point format. 
Steps 11 through 18 check for special cases of c(exp). In step 14, if c(exp) 
has overflowed (step 1 1 ) in the positive direction, then c(exp) is set to the 
most-positive extended-precision format value. If c(exp) has overflowed in 
the negative direction, then c(exp) is set to the most-negative extended-pre- 
cision format value. If c(exp) has underflowed (step 12), then c is set to zero 
(step 15); i.e., c{man) = and c(exp) = -128. 




5-9 



Floating-Point Operations - Multiplication 



a(man) 



b(man) 

_Lo, 



Multiply mantissas 



c(man) = a(man) * b(man) 
(50-blt result) 



a(exp) t><axp) 

J L«. 



Add exponents 



c(exp) = 8(exp) -f b(exp) 



Test for special cases of c(man) 



(3) 
c(man) = 



(4) (5) (6) 

Right— shift 1 Right— shift 2 No shift 

to normalize to normalize to normalize 



" (7) 



c(exp) = 
- 128 




(8) 



c(m0n) > > 1 

and 

c(exp) = 

c(exp) + 1 



(9) 



c(man) >> 2 

and 

c(oxp) = 

c(exp) + 2 



Dispose of extra bits 



Put c(man) In extended 
precision floating-point 
format. 



(10) 



Test for special cases of c(exp) 



(11) 
c(exp) overflow 



(12) (13) 

c(exp) underflow c(exp) In range 



(14) 



If c(m8n) > 0, 
set c to most 
positive value. 

If c(m8n) < 0, 
set c to most 
negative value. 



c(oxp) = -128 
c(man) = 



(15) 



Set c to final result 



(16) 



c = a • b 

Figure 5-9. Flowchart for Floating-Point Multiplication 
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The following examples illustrate how floating-point multiplication is per- 
formed on the TMS320C30. For these examples, the implied most-significant 
nonsign bit is made explicit. 

Example 5-1. Floating-Point Multiply (Both Mantissas = -2.0) 

Let 

a = -2.0 X 23(e^^) = 1 0.00000000000000000000000 x 2 ^^^^P) 
b - -2.0 X 2'3(exp) = 10.00000000000000000000000 x l^^^^P) 

where a and b are both represented in binary form according to the normalized single-pre- 
cision floating-point format. Then 

10.00000000000000000000000 X i^^exp) 
X 10.00000000000000000000000 X l^^exp) 

0100.0000000000000000000000000000000000000000000000 X 2(a( ^^P) + b(exp)) 

To place this number in the proper normalized format, it is necessary to shift the mantissa two 
places to the right and add two to the exponent. This yields 

10.00000000000000000000000 X 2a(e^P) 
X 10.00000000000000000000000 X 2^(g^^) 

01.0000000000000000000000000000000000000000000000 X 2(a( ^^P) + b(ex/7)+2) 

In floating-point multiplication, the exponent of the result may overflow. This can occur 
when the exponents are initially added or when the exponent is modified during normaliza- 
tion. 

Example 5-2. Floating-Point Multiply (Both Mantissas = 1.5) 

Let 

a = 1 .5 X l^^^^P) = 01 .1 0000000000000000000000 x 2 ^(ex/?) 
b = 1 .5 X l^^^^P) = 01 .1 0000000000000000000000 X 2 ^(ex/?) 

where a and b are both represented in binary form according to the single-precision float- 
ing-point format. Then 

01.10000000000000000000000 X 2a(^^^) 
X 01.10000000000000000000000 X 2^(^^P^ 

0010.0100000000000000000000000000000000000000000000 X 2 (a(e^P) + b{exp)) 

To place this number in the proper normalized format, it is necessary to shift the mantissa 
one place to the right and add one to the exponent. This yields 

01 .1 0000000000000000000000 X 2a(e^P) 
X 01 .1 0000000000000000000000 X 2b(g^P) 

01.00100000000000000000000000000000000000000000000 X 2 (a(e>fP) +b(exp) +1) 
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Example 5-3. Floating-Point Multiply (Both Mantissas = 1.0) 

Let 

a = 1.0 x2a(6^P) = 01.00000000000000000000000 x l^^^^P) 
b = 1.0 x2^^^>(P) = 01.00000000000000000000000 X 2 '^(^^P) 

where a and b are both represented in binary form according to the single-precision float- 
ing-point format. Then 

01 .00000000000000000000000 X 2a(e^P) 
X 01.00000000000000000000000 X 2^^^^P) 

0001.0000000000000000000000000000000000000000000000 X 2 (a(e^P) + b(exp)) 

This number is in the proper normalized format. Therefore, no shift of the mantissa or mod- 
ification of the exponent is necessary. 

These examples have shown cases where the product of two normalized numbers can be 
normalized with a shift of zero, one, or two. For all normalized inputs with the floating-point 
format used by the TMS320C30, a normalized result can be produced by a shift of zero, one, 
or two. 

Example 5-4. Floating-Point Multiply Between Positive and Negative Numbers 

Let 

a = 1.0 x2a(^^P) = 01.00000000000000000000000 x 2 a^^^P) 
b = 2.0 xl^^^^P) = 10.00000000000000000000000 X 2b(6^P) 



Then 

01 .00000000000000000000000 X 2a(^^P) 
X 10.00000000000000000000000 X 2b(exp) 



1110.0000000000000000000000000000000000000000000000 X 2 (a(e^P) + b(exp)) 
The result is c = -2.0 x 2(a(e^P) + b(exp)) 

Example 5-5. Floating-Point Multiply by Zero 

All multiplications by a floating-point zero yield a result of zero {f = 0, s = 0, and exp = 
-128). 
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5.5 Floating-Point Addition and Subtraction 

In floating-point addition and subtraction, two floating-point numbers a and 
b can be defined as 

a = a{man) x 2a(e>fP) 
b = h{man) x 2b(e^P) 

The sum (or difference) of a and b can be defined as 

c = a ± b 
= {a{man) ± {b{man) x 2 -(a(exp)-b(ex/j)))) ^ 2 ^(e^P), 

if a(exp) >. b{exp) 
= {{a{man) x 2 -(b{exp)-a(exp))) ± b(^ man)) x 2 b(exp)^ 

if a{exp) < b{exp) 

The flowchart for floating-point addition is shown in Figure 5-10. Since this 
flowchart assumes signed data, it is also appropriate for floating-point sub- 
traction. In this figure, it is assumed that a(exp) ^ b{exp). In step 1, the 
source exponents are compared, and c(exp) is set equal to the largest of the 
two source exponents. In step 2, d is set to the difference of the two expo- 
nents. In step 3, the mantissa with the smallest exponent, in this case 
a{man), is right-shifted d bits in order to align the mantissas. After the man- 
tissas have been aligned, they are added (step 4). 

Steps 5 through 7 check for a special case of c{man). If c{man) is zero (step 
5), then c{exp) is set to its most-negative value (step 8) to yield the correct 
representation of zero. If c{man) has overflowed c (step 6), then In step 9, 
c{man) is right-shifted one bit and one is added to c{exp) In step 10, the 
result is normalized. In steps 11 and 12, special cases of c{exp) are tested. 
If c(exp) has overflowed, then c is set to the most-positive extended-precision 
value if it is positive; otherwise, it is set to the most-negative extended-preci- 
sion value. 
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b{man) 



Align mantissas 



a(man) = a(nMn) >> d 

Discard LSBs 
to keep a(man) In 
extended-precision 
floating-point format 



a(exp) 



b(exp) 



J L 



Compare exponents 



If a(exp) < = b(axp) 
c(exp) = b<exp) 

else 
c(exp) = a(exp) 

[Assume for simplicity 

that alexp) < < = b(exp)I 



Subtract exponents 



d = b(e<p) - a(exp) 



Add mantissas 



c(man) = a(man) + b(man) 




Test for special cases of c(man) 



(6) (7) 

k - # leading 
Overflow of c(man) non-significant 
sign bits 



c(man) = c(man) >> 1 
c(exp) = c(exp) + 1 
Discard LSBs to keep in 
extended-precision 
floating-point format 



c(exp) = -128 



(9) 



c(man) < < k 
c(exp) = c(exp) -k 



Test for special cases of c(exp) 



(11) (12) (13) 

c(sxp) overflow c(exp) underflow c(exp) in range 



(14) 



If c(man) > 0, 
set c to most 
positive value, 

If c(man) < 0, 
set c to most 
negative value. 

1 



set c to zero 
c(exp) = -128 
c(man) = 



"J 



(15) 



(16) 



Set c to final result 



Figure 5-10. Flowchart for Floating-Point Addition 
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The following examples describe the floating-point addition and subtraction 
operations. It is assumed that the data is in the extended-precision floating- 
point format. 

Example 5-6. Floating-Point Addition 

In the case of two normalized numbers to be summed, let 

a = 1.5 = 01.1000000000000000000000000000000 x 2° 
b = 0.5 = 01 .0000000000000000000000000000000 X 2-^ 

It is necessary to shift b to the right by one so that a and b have the same 
exponent. This yields 

b = 0.5 = 00.1 000000000000000000000000000000 X 2° 
Then 

01 .1 000000000000000000000000000000 X 2° 
+ 00.1000000000000000000000000000000 X 2 ° 

010.0000000000000000000000000000000 X 2° 

As in the case of multiplication, it is necessary to shift the binary point one 
place to the left and to add one to the exponent. This yields 

01.1000000000000000000000000000000 X 2° 
+ 00.1000000000000000000000000000000 X 2 ° 

01 .0000000000000000000000000000000 X 2^ 

Example 5-7. Floating-Point Subtraction 

A subtraction is performed in this example. Let 

a = 01 .0000000000000000000000000000001 X 2° 
b = 01 .0000000000000000000000000000000 X 2° 

The operation to be performed is a - b. The mantissas are already aligned since 
the two numbers have the same exponent. The result is a large cancellation 
of the upper bits, as shown below. 

01 .0000000000000000000000000000001 X 2° 

- 01 .0000000000000000000000000000000 X 2 Q 

00.0000000000000000000000000000001 X 2° 

The result must be normalized. In this case, a left-shift of 31 is required. The 
exponent of the result is modified accordingly. The result is 

01 .0000000000000000000000000000001 X 2° 

- 01.0000000000000000000000000000000 X 2 Q 

01.0000000000000000000000000000000 X 2-31 
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Example 5-8. Floating-Point Addition with a 32-Bit Shift 

This example illustrates a situation where a full 32-bit shift is necessary to 
normalize the result. Let 

a = 01.1111111111111111111111111111111 X 2^27 
b = 10.0000000000000000000000000000000 X 2127 

The operation to be performed is a + b. 

01.1111111111111111111111111111111 X 2''27 
+ 10.0000000000000000000000000000000 X 2 ^27 

11.1111111111111111111111111111111 X 2^27 

Normalizing the result requires a left-shift of 32 and a subtraction of 32 from 
the exponent. The result is 

01.1111111111111111111111111111111 X 2^27 
+ 10.0000000000000000000000000000000 X 2 ^27 

10.0000000000000000000000000000000 X 295 

Example 5-9. Floating-Point Addition/Subtraction and Zero 

When floating-point addition and subtraction is performed with a floating- 
point 0, , the following identities are satisfied: 

a ± = a (a ?£ 0) 

± = 

0-a = -a (a 9i 0) 
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5.6 Normalization Using the NORM Instruction 

The NORM instruction takes an extended -precision floating-point number, 
assumed to be unnormalized, and normalizes it. Since the number is assumed 
to be unnormalized, no implied most-significant nonsign bit is assumed. The 
NORM instruction executes the following three steps: 

1 ) Locate the most-significant nonsign bit of the floating-point number. 

2) Left-shift to normalize the number. 

3) Adjust the exponent. 

Given the extended-precision floating-point value a to be normalized, the 
normalization (norm ()) is performed as shown in Figure 5-11. 
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Test for special cases of a(man) 



(2) 
(1) Leading non-slgnlf leant 

a(nian) = sign bits. 



(3) 



c(exp) = -128 




(9) 



k = # leading 
non-signlflcant 
sign bits 



Sign-extended a(/nan) 1 bit 
c{man) = a{man) << k 
c(exp) = a(exp) - k 



Remove most-slgnlflcant non-sign bit 



Test for special cases of c(exp) 



(6) (7) 

c(exp) underflow c(exp) in range 



(8) 



c(exp) = -128 

No change to c(man) 



Set c to final result 



(4) 



(5) 



c = norm(a) 



Figure 5-11. Flowchart for NORM Instruction Operation 
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Example 5-10. NORM Instruction 



Assume that an extended-precision register contains the value 
man = 00000000000000000001 000000000001 , exp = 

When the normalization is performed on a number assumed to be unnormal- 
ized, the binary point is assumed to be 

man = 0.0000000000000000001000000000001, exp = 

This number is then sign-extended one bit so that the mantissa contains 33 
bits. 

man = 00.0000000000000000001000000000001, exp = 

The intermediate result after the most-significant nonsign bit is located and 
the shift performed is 

man = 01.0000000000010000000000000000000, exp = -19 

The final 32-bit value output after removing the redundant bit is 

man = 00000000000010000000000000000000, exp = -19 

The NORM instruction is useful for counting the number of leading zeros or 
leading ones in a 32-bit field. If the exponent is initially zero, the absolute 
value of the final value of the exponent is the number of leading ones or zeros. 
This Instruction is also useful for manipulating unnormalized floating-point 
numbers. 
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5.7 Rounding: The RND Instruction 

The RND instruction rounds a number from the extended -precision floating- 
point format to the single-precision floating-point format. Rounding is similar 
to floating-point addition. Given the number a to be rounded, the following 
operation is performed first. 

c = a{man) x 2^^^^P) -^ (1 x 2 (a(exp)-24)) 

Next a conversion from extended -precision floating-point to single-precision 
floating-point format is performed. Given the extended-precision floating- 
point value, the rounding (rnd()) is performed as shown in Figure 5-12. 
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^ - a(exp) -24 
a 1x2*' 

i L 



Add a(man) and 1/2 an LSB 



c(man) = a(man) +-24 



Test for special cases of c(nian) 



c(man) = Overflow of c(man) No special case 



c(exp) = -128 



c(fnan) = c(fnan) > > 1 
c(exp) = a(exp) + 1 



Test for special cases of c(exp) 



c(exp) overflow c(exp) in range 



If c(man) > 0, 
set c to most- 
positive single- 
precision value. 

If c(man) < 0, 
set c to most- 
negative single- 
precision value. 




Set 8 LSBs of c(man) to zero 



c = md(a) 

Figure 5-12. Flowchart for Floating-Point Rounding by the RND 

Instruction 
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5.8 Floating-Point to Integer Conversion 

Floating-point to integer conversion, using the FIX instructions, allow ex- 
tended-precision floating-point numbers to be converted to single-precision 
integers in a single cycle. The floating-point to integer conversion of the value 
X will be referred to here as fix(x). The conversion will not overflow if a, the 
number to be converted, is in the range: 

-231 ^ a ^ 231 - 1 

First, it is necessary to be certain that 
a(exp) <, 30 

If these bounds are not met, an overflow occurs. If an overflow occurs in the 
positive direction, the output is the most positive integer. If an overflow oc- 
curs in the negative direction, the output is the most negative integer. If 
a(exp) is within the valid range, then a{man), with implied bit included, is 
sign-extended and right-shifted (rs) by the amount 

rs = 31 - a(exp) 

This right-shift (rs) shifts out those bits corresponding to the fractional part 
of the mantissa. For example: 

If ^ X < 1, then fix(x) = 0. 
If -1 ^ X < 0, then fix(x) = -1. 

The flowchart for the floating-point to integer conversion is shown in Figure 
5-13. 
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Test for speciai cases of a(exp) 


a(exp)>30 a(exp) in range 
^ '^' rs = 31 - a(exp) 


'1 




\ 


' 


Overfiow 




Shift 


If a(man) > 0, 

c = most positive 

integer. 

if a(man) < 0, 

c = most negative 

integer^ 


= a(man) > > rs 




















1' 


M 






Set c to final result 








1 


1 










c = fix(a) 

Figure 5-13. Flowchart for Floating-Point to Integer Conversion 
by FIX Instructions 
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5.9 Integer to Floating-Point Conversion Using the FLOAT 
Instruction 

Integer to floating-point conversion, using the FLOAT instruction, allows sin- 
gle-precision integers to be converted to extended -precision floating-point 
numbers. The flowchart for this conversion is shown in Figure 5-14. 




c(man) = a 
c(exp) = 30 



Test for special cases of c(man) 



Leading non-significant 
c(man) = sign bits. 



c(exp) = -128 



k =# leading 
non-significant 
sign bits 



c(man) = c(man) < < Ic 
c(exp) = 30 - l< 



Remove most significant 
non-sign bit. 



Set c to final result 



c = float (a) 



Figure 5-14. Flowchart for Integer to Floating-Point Conversion 
Using the FLOAT Instruction 
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Hardware Applications 



Section 6 

Addressing 



The TMS320C30 supports five groups of powerful addressing modes. Six 
types of addressing may be used within the groups, which allow access of 
data from memory, registers, and the instruction word. This section details the 
operation, encoding, and implementation of the addressing modes. Also dis- 
cussed is the management of system stacks, queues, and deques in memory. 
The major topics in this section are: 

9 Types of Addressing (Section 6.1 on page 6-2) 

- Register 

- Direct 

- Indirect 

- Short-immediate 

- Long-immediate 
PC-relative 

• Groups of Addressing Modes (Section 6.2 on page 6-18) 

- General addressing modes 

- Three-operand addressing modes 

- Parallel addressing modes 

- Long-immediate addressing mode 

- Conditional-branch addressing modes 

© Circular Addressing (Section 6.3 on page 6-22) 

9 Bit- Reversed Addressing (Section 6.4 on page 6-26) 

• System Stack Management (Section 6.5 on page 6-27) 
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6.1 Types of Addressing 



Six types of addressing allow access of data from memory, registers, and the 
instruction word. They are: 

• Register 

• Direct 

• Indirect 

• Short-immediate 

• Long-immediate 

• PC-relative 

Some types of addressing are appropriate for some instructions and not others. 
For this reason, the types of addressing are used in the five different groups 
of addressing modes as follows: 

• General addressing modes (G): 

- Register 

- Direct 

- Indirect 

- Short-immediate 

• Three-operand addressing modes (T): 

- Register 

- Indirect 

• Parallel addressing modes (P): 

- Register 

- Indirect 

• Long-immediate addressing mode 

- Long-immediate 

• Conditional-branch addressing modes (B): 

- Register 
PC-relative 

The six types of addressing will be discussed first, followed by the five groups 
of addressing modes. 



6.1.1 Register Addressing 



In register addressing, the operand is contained in a CPU register, as shown 
in the example below. 

ABSF Rl ; Rl = |R1| 

The syntax for the CPU registers, the assembler syntax, and the assigned 
function for those registers are listed in Table 6-1 . 
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Table 6-1. CPU Register/Assembler Syntax and Function 



CPU REGISTER 


ASSEMBLER 


ASSIGNED 


ADDRESS 


SYNTAX 


FUNCTION 


OOh 


RO 


Extended-precision register 


01 h 


R1 


Extended-precision register 


02h 


R2 


Extended-precision register 


03 h 


R3 


Extended-precision register 


04 h 


R4 


Extended-precision register 


05h 


R5 


Extended-precision register 


06h 


R6 


Extended-precision register 


07h 


R7 


Extended-precision register 


08h 


ARO 


Auxiliary register 


09h 


AR1 


Auxiliary register 


OAh 


AR2 


Auxiliary register 


OBh 


AR3 


Auxiliary register 


OCh 


AR4 


Auxiliary register 


ODh 


AR5 


Auxiliary register 


OEh 


AR6 


Auxiliary register 


OFh 


AR7 


Auxiliary register 


lOh 


DP 


Data page pointer 


llh 


IRO 


Index register 


12h 


IR1 


Index register 1 


13h 


BK 


Block size 


14h 


SP 


Active stack pointer 


15h 


ST 


Status register 


16h 


IE 


CPU/DMA interrupt enable 


17h 


IF 


CPU interrupt flags 


18h 


lOF 


I/O flags 


19h 


RS 


Repeat start address 


lAh 


RE 


Repeat end address 


IBh 


RC 


Repeat counter 




6.1.2 Direct Addressing 



In direct addressing, the data address is formed by the concatenation of the 
eight least-significant bits of the data page pointer (DP) with the 16 least- 
significant bits of the instruction word (expr). This results in 256 pages (64 
K words per page), giving the programmer a large address space without 
needing to change the page pointer. The syntax and operation for direct ad- 
dressing are listed below. 

Syntax: @expr 

Operation: address = DP concatenated with expr 

Figure 6-1 shows the formation of the data address. Example 6-1 gives an 
instruction example with data before and after instruction execution. 
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DP- 



Instruction 

Word ^ 


31 






16 


15 











expr 


31 




8 7 




'f 




X X. . .X 


X 


page 








31 


24 23" 









0. 


.0 


address 




31 


'' 









operand 



Figure 6-1. Direct Addressing 




Example 6-1. Direct Addressing 

ADDI @0BCDEh,R7 

Before Instruction: 

DP = 8Ah 

R7 = 0h 

Data at SABCDEh = 12345678h 



After Instruction: 

DP = 8Ah 

R7 = 12345678h 

Data at 8ABCDEh = 12345678h 



6.1.3 Indirect Addressing 



Indirect addressing is used to specify the address of an operand in memory 
through the contents of an auxiliary register, optional displacements, and in- 
dex registers. Only the 24 least-significant bits of the auxiliary registers and 
index registers are used in indirect addressing. This arithmetic is performed 
by the auxiliary register arithmetic units (ARAUs) on these lower 24 bits and 
is unsigned. The upper eight bits are unmodified. 

The flexibility of indirect addressing is possible because the ARAUs on the 
TMS320C30 are used to modify auxiliary registers in parallel with operations 
within the main CPU. Indirect addressing is specified by a five-bit field in the 
instruction word, referred to as the mod field. A displacement is either an ex- 
plicit unsigned 8-bit integer contained in the instruction word or an implicit 
displacement of one. Two index registers, IRO and IR1, can also be used in 
indirect addressing. In some cases, an addressing scheme using circular or 
bit reversed addressing is optional. The mechanism for generating addresses 
in circular addressing is discussed in Section 6.3, bit reversed in Section 6.4. 

Table 6-2 lists the various kinds of indirect addressing, along with the value 
of the modification (mod) field, assembler syntax, operation, and function for 
each. The succeeding 18 examples show the operation for each kind of indi- 
rect addressing. 
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Table 6-2. indirect Addressing 



MOD FIELD 


SYNTAX 


OPERATION 


DESCRIPTION 


INDIRECT ADDRESSING WITH DISPLACEMENT 


00000 


*+ARn(disp) 


addr = ARn + disp 


With pre-displacement add 


00001 


*-ARn(disp) 


addr = ARn - disp 


With pre-displacement subtract 


00010 


* ++ARn(disp) 


addr = ARn + disp 
ARn = ARn + disp 


With pre-displacement add and modify 


00011 


*--ARn(disp) 


addr = ARn - disp 
ARn = ARn - disp 


With pre-displacement subtract and 
modify 


00100 


*ARn + + (disp) 


addr = ARn 

ARn = ARn + disp 


With post-displacement add and 
modify 


00101 


*ARn--(disp) 


addr = ARn 

ARn = ARn - disp 


With post-displacement subtract and 
modify 


00110 


*ARn + + (disp)% 


addr = ARn 

ARn = circ(ARn + disp) 


With post-displacement add and 
circular modify 


00111 


*ARn--(disp)% 


addr + ARn 

ARn = circ(ARn - disp) 


With post-displacement subtract and 
circular modify 


INDIRECT ADDRESSING WITH INDEX REGISTER IRO 


01000 


*+ARn(IRO) 


addr = ARn + IRO 


With pre-index (IRO) add 


01001 


*-ARn(IRO) 


addr = ARn - IRO 


With pre-index (IRO) subtract 


01010 


• + +ARn(IRO) 


addr = ARn + IRO 
ARn = ARn + IRO 


With pre-index (IRO) add and modify 


01011 


*--ARn(IRO) 


addr = ARn - IRO 
ARn = ARn - IRO 


With pre-index (IRO) subtract and 
modify 


01100 


*ARn + + (IRO) 


addr = ARn 

ARn = ARn + IRO 


With post-index (IRO) add and modify 


01101 


*ARn--(IRO) 


addr = ARn 
ARn = ARn - IRO 


With post-index (IRO) subtract and 
modify 


01110 


*ARn + + (IRO)% 


addr = ARn 

ARn = circ(ARn + IRO) 


With post-index (IRO) add and 
circular modify 


01111 


*ARn--(IRO)% 


addr = ARn 

ARn = circ(ARn - IRO) 


With post-index (IRO) subtract and 
circular modify 



LEGEND: 




addr 


= memory address 


ARn 


= auxiliary register ARO - AR7 


IRn 


= index register IRO or IR1 


disp 


= displacement 


+ + 


= add and modify 


-- 


= subtract and modify 


circO 


= address in circular addressing 


% 


= where circular addressing is performed 


B 


= where bit-reversed addressing is performed 
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Table 6-2. Indirect Addressing (Concluded) 



MOD FIELD 


SYNTAX 


OPERATION DESCRIPTION 


INDIRECT ADDRESSING WITH INDEX REGISTER IR1 


10000 


•+ARn(IR1) 


addr = ARn + IR1 


With pre- index (IR1) add 


10001 


*-ARn(IR1) 


addr = ARn - IR1 


With pre-index (IR1) subtract 


10010 


* + +ARn(IR1) 


addr = ARn + |R1 
ARn = ARn + IR1 


With pre-index (IR1) add and modify 


10011 


•--ARn(IRI) 


addr = ARn - IR1 
ARn = ARn - IR1 


With pre-index (IRI ) subtract and 
modify 


10100 


•ARn + + (IR1) 


addr = ARn 
ARn = ARn + IR1 


With post-index (IRI) add and modify 


10101 


*ARn--(IR1) 


addr = ARn 
ARn = ARn - IR1 


With post-index (IRI) subtract and 
modify 


10110 


*ARn + + (IR1)% 


addr = ARn 

ARn = circ(ARn + IR1) 


With post-index (IRI) add and 
circular modify 


10111 


•ARn--(IR1)% 


addr = ARn 

ARn = circ(ARn - IR1) 


With post-index (IRI) subtract and 
circular modify 


INDIRECT ADDRESSING (SPECIAL CASES) 


11000 


•ARn 


addr = ARn 


Indirect 


11001 


*ARn + + (IRO)B 


addr = ARn 

ARn = B(ARn + IRO) 


With post-index (IRO) add and 
bit-reversed modify 



LEGEND: 

addr = memory address 

ARn = auxiliary register ARO - AR7 

IRn = index register IRO or IRI 

disp = displacement 

+ + = add and modify 

= subtract and modify 

circ() = address in circular addressing 

% = where circular addressing is performed 

B = where bit-reversed addressing is performed 
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Example 6-2. Auxiliary Register indirect 

The address of the operand to be fetched is contained in an auxiliary register (ARn). 
Operation: operand address = ARn 

Assembler Syntax: *AR/7 

Modification Field: 11000 



ARn- 



31 


24 23 







X 


X 


address 


31 




" 





operand 
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Example 6-3. Indirect with Pre-Displacement Add 



The address of the operand to be fetched is the sum of an auxiliary register (ARn) and the 
displacement (disp). The displacement is either an eight-bit unsigned integer contained in 
the instruction word or an implied value of 1 . 



Operation: 
Assembler Syntax: 
Modification Field: 



operand address = AR/7+disp 

*+ARn(disp) 

00000 




31 



disp 



31 



ARn- 



0...0 



31 



24 23 



address 



8 7 



integer 



•1 + ) 



operand 



Example 6-4. Indirect with Pre-Displacement Subtract 



The address of the operand to be fetched is the contents of an auxiliary register (ARn) minus 
the displacement (disp). The displacement is either an eight-bit unsigned integer contained 
in the instruction word or an implied value of 1 . 



Operation: 
Assembler Syntax: 
Modification Field: 

ARn 



31 



24 23 



operand address = AR/7-disp 

*-ARn(disp) 

00001 



address 



31 
disp 



0...0 



31 



8 7 



integer 



operand 



■(-) 
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Example 6-5. Indirect with Pre-Displacement Add and Modify 



The address of the operand to be fetched is the sum of an auxiliary register (ARn) and the 
displacement (disp). The displacement is either an eight-bit unsigned integer contained in 
the instruction word or an implied value of 1 . After the data is fetched, the auxiliary register 
is updated with the address generated. 



Operation: 

Assembler Syntax: 
Modification Field: 

ARn— e 

31 



operand address = AR/j+disp 

AR/7=AR/7+disp 

* + +AR/7(disp) 

00010 



31 



24 23 



address 



8 7 



disp 



0...0 



integer 



.{ + ) 




31 



operand 



Example 6-6. Indirect with Pre-Displacement Subtract and Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn) minus 
the displacement (disp). The displacement is either an eight-bit unsigned integer contained 
in the instruction word or an implied value of 1 . After the data is fetched, the auxiliary reg- 
ister is updated with the address generated. 



Operation: 

Assembler Syntax: 
Modification Field: 

ARn— • 



31 



24 23 



operand address = AR/7-disp 
AR/7=AR/7+disp 

*--AR/7(disp) 

00010 



address 



31 
disp 



0...0 



8 7 



integer 



.(-) 



31 



operand 
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Example 6-7. Indirect with Post-Displacement Add and Modify 



The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) is added to the auxiliary register. The dis- 
placement is either an eight-bit unsigned integer contained in the instruction word or an 
implied value of 1. 



Operation: 

Assembler Syntax: 
Modification Field: 



operand address = AR/7 
ARA7=AR/7+disp 

*AR/7+ + (disp) 

00100 




31 



ARn- 



disp 



31 



24 23 



address 



31 8 


7 


0...0 


integer 



■{+)• 



operand 



" 



Example 6-8. Indirect with Post-Displacement Subtract and Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) is subtracted from the auxiliary register. The 
displacement is either an eight-bit unsigned integer contained in the instruction word or an 
implied value of 1. 



Operation: 

Assembler Syntax: 
Modification Field: 



31 



ARn- 



31 



disp 



31 



operand address = AR/7 
ARn=AR/7-disp 

*AR/7--(disp) 

00101 



24 23 



address 



8 7 



0...0 


integer 



-(-)- 



W 



operand 
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Example 6-9. Indirect with Post-Displacement Add and Circular Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) is added to the contents of the auxiliary re- 
gister using circular addressing. This result is used to update the auxiliary register. The 
displacement is either an eight-bit unsigned integer contained in the instruction word or an 
implied value of 1. 



Operation: 

Assembler Syntax: 

Modification Field: 
31 



operand address = ARn 
AR/7=circ(AR/7+disp) 

*AR/7 + + (disp)% 

00110 



24 23 





ARn— » 


x 


X 


address 


31 






8 7 





f 

(%) 
I 







0...0 





integer 


^{ + ).^-H 






31 






' 






operand 




Example 6-10. Indirect with Post-Displacement Subtract and Circular Modify 



The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the displacement (disp) is subtracted from the contents of the auxil- 
iary register using circular addressing. This result is used to update the auxiliary register. 
The displacement is either an eight-bit unsigned integer contained in the instruction word 
or an implied value of 1 . 



Operation: 

Assembler Syntax: 

Modification Field: 
31 



operand address = AR/7 
ARn=circ(ARA7-disp) 

*AR/7--(disp)% 

00111 



24 23 





ARn— » 


X 


X 


address 


31 






8 7 





1 







0...0 





integer 


»-{-)-*-< 


) 






31 






1 


' 






operand 
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Example 6-11. Indirect with Pre-lndex Add 



The address of the operand to be fetched is the sum of an auxiliary register (ARn) and an 
index register (IRO or IR1 ). generated. 




Operation: 
Assembler Syntax: 
Modification Field: 



31 



IRm. 



operand address = ARn+IRm 
*+AR/7(IRm) 



01000 
10000 



if m=0 
if m = 1 



24 23 





ARn — ^ 


X 


X 


address 


31 


24 23 















X 


X 


index 


^{ + ) 








31 






' ' 









operand 



Example 6-12. Indirect with Pre-lndex Subtract 



The address of the operand to be fetched is the difference of an auxiliary register (ARn) and 
an index register (IRO or IR1 ). 



Operation: 
Assembler Syntax: 
Modification Field: 



IRm. 



operand address = ARn-IRm 
*-AR/7(IRm) 



01001 
10001 



if m=0 
if m=1 







31 


24 23 











ARn— » 


X 


X 


address 


31 


24 23 










index 




^i- 


-) 


31 






' 





operand 
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Example 6-13. Indirect with Pre-lndex Add and Modify 



The address of the operand to be fetched is the sum of an auxiliary register (ARn) and an 
index register (IRO or IR1). After the data is fetched, the auxiliary register is updated with 
the address generated. 



Operation: 



operand address = AR/7+IRm 
AR/7=AR/7+IRm 



Assembler syntax: 
Modification Field: 



31 



IRm. 



' + +ARn{\Hm) 




01010 


if m=0 


10010 


if m=1 



24 23 





ARn— «• 


X 


X 


address 


31 


24 23 











a 


x 


x 


index 


— ^( + ) 
1 












1 










31 


■ 


1 









operand 




Example 6-14. indirect with Pre-lndex Subtract and Modify 



The address of the operand to be fetched is the difference of an auxiliary register (ARn) and 
an index register (IRO or IR1). The resulting address becomes the new contents of the 
auxiliary register. 



Operation: 



operand address = AR/7-IRm 
AR/7=AR/7-IRm 



Assembler Syntax: 
Modification Field: 



31 



IRm. 



*--AR/7(IRm) 




01011 


if m=0 


10011 


if m = 1 



24 23 





ARn— ♦ 


X 


X 


address 


31 


24 23 











/ 


I 


x 


X 


index 




-) 












i 


k 








31 


' 


f 









operand 
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Example 6-15. Indirect with Post-Index Add and Modify 




The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1 ) is added to the auxiliary register. 

Operation: 



Assembler Syntax: 
Modification Field: 



operand address = 
ARn=ARA7+IR/n 


= AR/7 


*ARn+ + (IRm) 




01100 
10100 


if m=0 
if m=1 



31 



24 23 



IRm 





ARn— ^ 


X 


X 


address 


31 


24 23 









ii 




X 


X 


index 


»-( + ).«-( 


► 






31 






' 






operand 



Example 6-16. Indirect with Post-Index Subtract and Modify 



The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1) is subtracted from the auxiliary regis- 
ter. 



Operation: 

Assembler Syntax: 
Modification Field: 



IRm. 



31 



operand address = ARn 
AR/7=ARn-IRm 

•ARn"(IRm) 



01101 
10101 



if m=0 
if m=1 



24 23 





ARn— » 


X 


X 


address 


31 


24 23 









il 






X 


X 


index 


— ^1- 


t 






31 








' 






operand 
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Example 6-17. Indirect with Post-Index Add and Circular Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1 ) is added to the auxiliary register. This 
value is evaluated using circular addressing and replaces the contents of the auxiliary register. 



Operation: 

Assembler Syntax: 
Modification Field: 



operand address = AR/7 
AR/;=circ(AR/7+IR/77) 

*AR/7+ + (|Rm)% 

01110 
10110 



if m=0 
if m=1 



31 



24 23 



IRm. 





ARn— » 


X 


X 


address 


31 


24 23 









1 

(%) 
I 




x 


x 


index 


»-( + )-*H 


i 






31 













operand 




Example 6-18. Indirect with Post-Index Subtract and Circular Modify 



The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO or IR1) is subtracted from the auxiliary regis- 
ter. This value is evaluated using circular addressing and replaces the contents of the auxil- 
iary register. 



Operation: 



Assembler Syntax: 



Modification Field: 



31 



IRm. 



operand address = ARn 
AR/7=circ(AR/7-IRm) 

*AR/7--(IRm)% 



01111 
10111 



if m=0 
if m=1 



24 23 





ARn— » 


X 


X 


address 


31 


24 23 





f 

(%) 
1 




X 


X 


index 


»-{-)-*-< 


► 






31 






' 


f 






operand 
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Example 6-19. Indirect with Post-Index Add and Bit-Reversed Modify 

The address of the operand to be fetched is the contents of an auxiliary register (ARn). After 
the operand is fetched, the index register (IRO) is added to the auxiliary register. This addi- 
tion is performed with a reverse-carry propagation and can be used to yield a bit-reversed 
(B) address. This value replaces the contents of the auxiliary register. 



Operation: 

Assembler Syntax: 
Modification Field: 

ARn— • 



operand address = ARn 
ARn=B{ARA7+IR0) 

*AR/7+ + (|R0)B 

11001 



31 



24 23 



address 



31 



24 23 




IRm. 



X X 


index 



(B) 

I 

-( + ). 



31 



" 



operand 



6.1.4 Short-Immediate Addressing 

In short-immediate addressing, the operand is a 16-bit immediate value con- 
tained in the 16 least-significant bits of the instruction word (expr). De- 
pending upon the data types assumed for the instruction, the short-immediate 
operand may be a two's-complement integer, an unsigned integer, or a float- 
ing-point number. The syntax for this mode is listed below. 



Syntax: 



expr 



Example 6-20 gives an instruction example with before and after instruction 
data. 

Example 6-20. Short-Immediate Addressing 

SUBI 1,R0 



Before Instruction: 

RO = Oh 



After Instruction: 

RO = OFFFFFFFFh 
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6.1.5 Long-Immediate Addressing 

In long-immediate addressing, the operand is a 24-bit immediate value con- 
tained in the 24 least-significant bits of the instruction word (expr). The syn- 
tax for this mode is listed below. 

Syntax: expr 

Example 6-21 gives an instruction example with before and after instruction 
data. 

Example 6-21. Long-Immediate Addressing 

BR 8000h 

Before Instruction: After Instruction: 

PC = Oh PC = 8000h 



6.1.6 PC-Relative Addressing 

PC-relative addressing is used for branching. It replaces the value of the PC 
based upon the contents of the 1 6 least significant bit of the instruction word. 
The assembler takes the src (a label or address) specified by the user and 
generates a displacement. If the branch is a standard branch, this displace- 
ment is equal to the label - (PC+1). If the branch is a delayed branch, this 
displacement is equal to the label - (PC+3). 

The displacement is stored as a 16-bit signed integer in the least significant 
bits of the instruction word. 

Syntax: expr 

Example 6-22 gives an instruction example with before and after instruction 
data. 

Example 6-22. PC-Relative Addressing 

BU NEWPC ; pc=l, NEWPC=5 , displacement=3 

Before Instruction: After Instruction: 

PC = 1h PC = 5h 
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6.2 Groups of Addressing Modes 

The types of addressing are used to form the following five groups of ad- 
dressing modes: 

General addressing modes (G) 
Three-operand addressing modes (T) 
Parallel addressing modes (P) 
Long-immediate addressing mode 
Conditional-branch addressing modes (B) 

hese groups of addressing modes are discussed in the following sections. 



6.2.1 General Addressing Modes 




Instructions that use the general addressing modes are general-purpose in- 
structions, such as ADDI, MPYF, and LSH. Such instructions are usually of 
the form: 

dst operation src -* dst 

where the destination operand is signified by dst, the source operand by src, 
and 'operation' defines an operation to be performed using the general ad- 
dressing modes to specify certain operands. Bits 31 -29 are zero, indicating 
general addressing mode instructions. Bits 22 and 21 specify the general ad- 
dressing mode (G) field, which defines how bits 15 through are to be in- 
terpreted for addressing the src operand. 

Options for bits 22 and 21 (G field) are as follows: 

register (all CPU registers unless specified otherwise) 

1 direct 

1 indirect 
1 1 immediate 

If the src and dst fields contain register specifications, the value in these fields 
contains the CPU register addresses as defined by Table 6-1 . For the general 
addressing modes, the following values of ARn are valid: 

ARn, < n < 7 

Figure 6-2 shows the encoding for the general addressing modes. The nota- 
tion 'modn' indicates the modification field that goes with the ARn field. Refer 
to Table 6-2 for further information. 



31 29 28 




23 


22 21 


20 




1615 1110 87 


54 








operation 





dst 


0000000000 


1 src 


1 





operation 


1 


dst 


direct 1 





operation 


1 


dst 


modn 1 ARn | 


disp 







operation 


1 1 


dst 


immediate | 



I G I Destii 



Source Operands 



Figure 6-2. Encoding for General Addressing Modes 
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6.2.2 Three-Operand Addressing Modes 

Instructions that use the three-operand addressing modes, such as ADDIS, 
LSH3, CMPF3. or X0R3, are usually of the form: 

SRC1 operation SRC2 -^ dst 

where the destination operand is signified by dst, the source operands by 
SRC1 and SRC2, and 'operation' defines an operation to be performed. Note 
that the '3' can be omitted from three-operand instructions. 

Bits 31-29 are set to the value of 001, indicating three-operand addressing 
mode instructions. Bits 22 and 21 specify the three-operand addressing mode 
(T) field, which defines how bits 15-0 are to be interpreted for addressing the 
SRC operands. Bits 15-8 are used to define the SRC1 address, and bits 7-0 
to define the SRC2 address. Options for bits 22 and 21 (T) are as follows: 



SRC1 



SRC2 



00 


Register 


Register 


1 


Indirect 


Register 


1 


Register 


Indirect 


1 1 


Indirect 


Indirect 




Figure 6-3 shows the encoding for three-operand addressing. If the SRC1 
and SRC2 fields use the same auxiliary register, both addresses are correctly 
generated. However, only the value created by the SRC1 field is saved in the 
auxiliary register specified. The assembler issues a warning if this condition 
is specified by the user. 

The following values of ARn and ARm are valid: 

ARn,0 ^ n ^ 7 
ARm,0 ^ m ^ 7 

The notation "modm" or "modn" indicates the modification field goes with the 
ARm or ARn field respectively. Refer to Table 6-2 for further information. 

In indirect addressing of the three-operand addressing mode, displacements 
(if used) are allowed of or 1, and the index registers (IRO and IR1) can be 
used. The displacement of 1 is implied and is not explicitly coded in the in- 
struction word. 



31 29 28 




23 22 21 


20 




1615 1312 11 


10 87 


54 32 


1 


operation 





dst 





srcl 








src2 


1 


operation 


1 


dst 


modn 


ARn 








src2 


1 


operation 


1 


dst 





srcl 


modn 


ARn 


1 


operation 


1 1 


dst 


modn 


ARn 


modm 


ARm 



SRC1 



SRC2 



Figure 6-3. Encoding for Three-Operand Addressing Modes 
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6.2.3 Parallel Addressing Modes 



Instructions that use parallel addressing (indicated by || (two vertical bars)) 
allow for the greatest amount of parallelism possible. The destination oper- 
ands are indicated as d1 and d2, signifying cfst^ and dst2, respectively (see 
Figure 6-4). The source operands, signified by src^ and src2, use the ex- 
tended-precision registers. The parallel operation to be performed is notated 
as 'operation'. 



31 30 29 



26 25 24 23 22 21 19 18 



1615 



11 10 



87 



32 



1 


operation 


P 


dl 


d2 


srcl 


src2 


modn 


ARn 


modm 


ARm 



src3 



srcA 



Figure 6-4. Encoding for Parallel Addressing Modes 



The parallel addressing mode (P) field specifies how the operands are to be 
used, i.e., whether they are source or destination. The specific relationship 
between the P field and the operands is detailed in the description of the in- 
dividual parallel instructions (see Section 11). However, the operands are al- 
ways encoded in the same way. Bits 31 and 30 are set to the value of 10, 
indicating parallel addressing mode instructions. Bits 25 and 24 specify the 
parallel addressing mode (P) field, which defines how bits 21 -0 are to be in- 
terpreted for addressing the src operands. Bits 21 -19 are used to define the 
src^ address, bits 1 8-1 6 to define the srcl address, bits 1 5-8 the srcZ address, 
and bits 7-0 the srcA address. The notation 'modn' and 'modm' indicate 
which modification field goes with which ARn or ARm (auxiliary register) 
field, respectively. The parallel addressing operands are listed below. 

srcl < srcl <, 7 (extended-precision registers R0-R7) 

src2 <, src2 ^ 7 (extended-precision registers R0-R7) 

dl If 0, (/sf / is RO. If 1 , (/sf / is R1 . 

d2 If 0, dst2 is R2. If 1 , dst2 is R3. 

P ^ P < 3 

src3 indirect (disp = 0, 1, I RO, I R1) 

src4 indirect (disp = 0, 1, I RO, I R1) 

As in the three-operand addressing mode, indirect addressing in the parallel 
addressing mode allows for displacements of or 1 and the use of the index 
registers (IRO and IR1 ). The displacement of 1 is implied and is not explicitly 
coded in the instruction word. 

In the encoding shown for this mode in Figure 6-4, if the src3 and src4 fields 
use the same auxiliary register, both addresses are correctly generated, but 
only the value created by the src3 field is saved in the auxiliary register speci- 
fied. The assembler issues a warning if this condition is specified by the user. 



6-20 



Addressing - Groups of Addressing Modes 



6.2.4 Long-Immediate Addressing Mode 

The long-immediate addressing mode is used to encode the program control 
instructions (BR, BRD, and CALL), for which it is useful to have a 24-bit ab- 
solute address contained in the instruction word. The unconditional branches, 
BR (standard) and BRD (delayed), use the long-immediate addressing mode. 
Bits 31-25 are set to the value of 0110000, indicating long-immediate ad- 
dressing mode instructions. Selection of bit 24 determines the type of branch: 
D = for a standard branch or D = 1 for a delayed branch. The long-immed- 
iate operand is the 24-bit src. These instructions are encoded as shown in 
Figure 6-5. 



31 



25 24 23 



110 I'd]" 



Figure 6-5. Encoding for Long-Immediate Addressing Mode 



6.2.5 Conditional-Branch Addressing Modes 

Instructions using the conditional-branch addressing modes {Bcond, 
BcondD, CALLcond, DBcond, and DBcondD) can perform a variety of con- 
ditional operations. Bits 31 -27 are set to the value of 01 1 01 , indicating con- 
ditional-branch addressing mode instructions. Bit 26 is set to or 1, the 
former selects DBcond, the latter Bcond. Selection of bit 25 determines the 
conditional-branch addressing mode (B). If B == 0, register addressing is 
used; if B = 1, PC-relative addressing is used. Selection of bit 21 sets the type 
of branch: D = for a standard branch or D = 1 for a delayed branch. The 
condition field {cond) specifies the condition checked to determine what ac- 
tion to take, i.e., whether or not to branch (see Section 11 for a list of condi- 
tion codes). Figure 6-6 shows the encoding for conditional-branch 
addressing. 




DBcond(D): 

31 27 26 25 24 22 21 20 




1615 54 





1 1 1 1 1 1 B 1 ARn | d| 


cond 


|00000000000| src reg. 


1 


Bcond(D): 1 1 


|o 1 1 i|i|b|o o|d| 


cond 


1 immediate 


_ 



Figure 6-6. Encoding for Conditional-Branch Addressing Modes 
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6.3 Circular Addressing 



Many algorithms, such as convolution and correlation, require the implemen- 
tation of a circular buffer in memory. In convolution and correlation, the cir- 
cular buffer is used to implement a sliding window which contains the most 
recent data to be processed. As new data is brought in, the new data over- 
writes the oldest data. Key to the implemention of a circular buffer is the im- 
plementation of a circular addressing mode. This section describes the circular 
addressing mode of the TMS320C30. 

The blocksize register (BK) specifies the size of the circular buffer. Informa- 
tion concerning the lower 16 bits of the BK register plus a user-selected aux- 
iliary register (ARn) are used to specify the bottom of the circular buffer. The 
information concerning the BK register is the location of the the first 1 bit, 
counting from the most-significant bit to the least-significant bit, in the lower 
1 6 bits. With the location of the first 1 bit specified as bit N, the address at 
the top of the buffer is referred to as the effective base (EB) and is equal to 
bits 31 through (N + 1 ) of ARn with bits N through of EB being zero. 

Figure 6-7 illustrates the relationships between the blocksize register (BK), the 
auxiliary registers (ARn), the bottom of the circular buffer, the top of the cir- 
cular buffer, and the index into the circular buffer. 
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31 N + 1 N 
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H . . . H 


1(N LSBs 
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L. . . L 



CIRCULAR 

ADDRESSING 

ALGORITHM 
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0. . .0 



L' . . . L' 



LEGEND: 

ARn = auxiliary register n 
= blocksize register 
= effective base 
= high-order bits 



BK 
EF 
H 



L = low-order bits 

L' = new low-order bits 

LSB = least-significant bit 

N = bit value 



Figure 6-7. Flowchart for Circular Addressing 



In circular addressing, 'index' refers to the N LSBs of the auxiliary register se- 
lected, and 'step' Is the quantity being added to or subtracted from the auxil- 
iary register. The following two rules must be followed when using circular 
addressing: 

• The step used must be less than or equal to the blocksize. 

• The first time the circular queue is addressed, the auxiliary register must 
be pointing to an element in the circular queue. 
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The algorithm for circular addressing is as follows: 

If £ index + step < BK: 

index = index + step. 
Else if index + step >. BK: 

index = index + step - BK. 
Else if index + step < 0: 

index = index + step + BK. 

Figure 6-8 shows how the circular buffer is implemented. It illustrates the re- 
lationship of the quantities generated and the elements in the circular buffer. 
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Address 
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Data 
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.H 
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-¥ 
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31 







Last Element 




H. 


.H 


LSBs of BK 


— »• 


Last Element + 1 



Figure 6-8. Circular Buffer Implennentation 



Figure 6-9 provides an example that shows the operation of circular address- 
ing. Assuming that all ARs are four bits, let ARO = 0000,and BK = 0110 
(blocksize of 6). This example shows a sequence of modifications and the 
resulting value of ARO. It also shows how the pointer steps through the cir- 
cular queue, with a variety of step sizes (both incrementing and decrement- 
ing). 
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*ARO 

*AR0++(5)% 

*AR0++(2)% 

*AR0~(3)% 

*AR0++(6)% 

*ARO — % 



ARO = (0th value) 

ARO = 5 (1st value) 

ARO = 1 (2nd value) 

ARO = 4 (3rd value) 

ARO = 4 (4th value) 

ARO = 3 (5th value) 



Value 

0th -^ 
2nd -* 



5th 

4th, 3rd 

1st 



Data 


Element 


Element 1 


Element 2 


Element 3 


Element 4 


Element 5 (Last Element) 


Last Element + 1 



Address 


1 
2 
3 
4 
5 
6 



Figure 6-9. Circular Addressing Example 



Circular addressing is especially useful for the implementation of FIR filters. 
Figure 6-10 shows one possible data structure for FIR filters. Note that the 
initial value of ARO points to h(N-1), and the initial value of AR1 points to 
x(0). Circular addressing is used in the TMS320C30 code for the FIR filter 
shown in Figure 6-11. 




Impulse Response 



Input Samples 



ARO 



h(N-1) 



h(N-2) 



h(2) 



h(1) 



h(0) 
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Figure 6-10. Data Structure for FIR Filters 
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* Initialization 



TOP 



LDI 


N,BK 


LDI 


H,ARO 


LDI 


X,AR1 


LDF 


IN, R3 


STF 


R3,*AR1++ 


LDF 


0,R0 


LDF 


0,R2 


Filter 





Load block size. 

Load pointer to impulse response. 
Load pointer to bottom of input 
sample buffer. 

Read input sample. 

Store with other samples. 

and point to top of buffer. 

Initialize RO . 

Initialize R2 . 



Repeat next instruction. 



RPTS N-1 

MPYF3 * ARO++% , *ARl++% , RO 

ADDF3 R0,R2,R2 ; Multiply and accumulate. 

ADDF R0,R2 ; Last product accumulated. 



STF 
B 




R2,Y 
TOP 



Save result, 
Repeat . 



Figure 6-11. FIR Filter Code Using Circular Addressing 
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6.4 Bit-Reversed Addressing 



Bit-reversed addressing on the TMS320C30 is useful in FFT algorithms using 
a variety of radices. One auxiliary register is used as a pointer to the physical 
location of a data value. IRO is used to specify the size of the FFT; e.g., the 
value contained in IRO must be equal to 2" where n is an integer. By adding 
IRO to the auxiliary register using bit-reversed addressing, addresses are gen- 
erated in a bit-reversed fashion. 

To illustrate this kind of addressing, assume eight-bit auxiliary registers. Let 
AR2 contain the value 0110 0000 (96). This is the base address of the data 
in memory. Let IRO contain the value 0000 1000 (8). Figure 6-12 shows a 
sequence of modifications of AR2 and the resulting values of AR2. 

*AR2 

*AR2++(IR0)B 

*AR2++(IR0)B 

*AR2++{IR0)B 

*AR2++{IR0)B 

*AR2++(IR0)B 

*AR2++(IR0)B 

*AR2++(IR0)B 

Figure 6-12. Bit-Reversed Addressing Example 
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Olio 


0010 
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Olio 
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(5th 
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AR2 


= 


Olio 


Olio 


(6th 


value) 


AR2 
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0110 


1110 


(7th 


value) 




Table 6-3 shows the relationship of the index steps and the four LSBs of AR2. 
It can be seen that the four LSBs can be found by reversing the bit pattern of 
the steps. 

Table 6-3. Index Steps and Bit-Reversed Addressing 



STEP 


BIT PATTERN 


BIT-REVERSED PATTERN 


BIT-REVERSED STEP 





GOOD 


0000 





1 


0001 


1000 


8 


2 


0010 


0100 


4 


3 


0011 


1100 


12 


4 


0100 


0010 


2 


5 


0101 


1010 


10 


6 


0110 


0110 


6 


7 


0111 


1110 


14 
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6.5 System and User Stack Management 

The TMS320C30 provides a dedicated system stack pointer (SP) for building 
stacks in memory. The auxiliary registers can also be used to build a variety 
of more general linear lists. This section discusses the implementation of the 
following types of linear lists: 




Stack 



A linear list for which al 
end of the list. 



insertions and deletions are made at one 



Queue A linear list for which all insertions are made at one end of the list, 
and all deletions are made at the other end. 

Deque A 'double-ended queue' linear list for which insertions and deletions 
are made at the either end of the list. 

The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The system stack fills from low-memory address 
to high-memory address (see Figure 6-13). The SP always points to the last 
element pushed onto the stack. A push performs a preincrement; and a pop, 
a postdecrement of the system stack pointer. 

The program counter is pushed on the system stack on subroutine calls, traps, 
and interrupts. It is popped from the system stack on returns. The system stack 
can be pushed and popped using the PUSH, POP, PUSHF, and POPF in- 
structions. 

LOW MEMORY 



SP 



BOTTOM OF STACK 



TOP OF STACK 



(FREE) 



HIGH MEMORY 
Figure 6-13. System Stack Configuration 



6.5.1 Stacks 



Stacks can be built from low to high memory or high to low memory. Two 
cases for each type of stack are shown. Stacks can be built using the 
preincrement/decrement and postincrement/decrement modes of modifying 
the auxiliary registers (AR). Stack growth from high-to-low memory can be 
implemented in two ways: 

CASE 1 : Stores to memory using *--ARn to push data on the stack and reads 
from memory using *ARn + -i- to pop data off the stack. 

CASE 2: Stores to memory using *ARn-- to push data on the stack and reads 
from memory using * ++ARn to pop data off the stack. 
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Figure 6-14 illustrates these two cases. The only difference is that in using 
case 1, the AR always points to the top of the stack, and in case 2, the AR 
always points to the next free location on the stack. 



CASE 1 
LOW MEMORY 



CASE 2 
LOW MEMORY 



ARn 



(FREE) 



TOP OF STACK 



BOTTOM OF STACK 



ARn 



(FREE) 



TOP OF STACK 



BOTTOM OF STACK 



HIGH MEMORY HIGH MEMORY 

Figure 6-14. tmplementions of High-to-Low Memory Staclis 

Stack growth from low-to-high memory can be implemented in two ways: 

CASE 3: Stores to memory using * + +ARn to push data on the stack and 
reads from memory using *ARn-- to pop data off the stack. 

CASE 4: Stores to memory using *ARn + + to push data on the stack and 
reads from memory using *--ARn to pop data off the stack. 

Figure 6-15 shows these two cases. In the case 3, the AR always points to 
the top of the stack. In case 4, the AR always points to the next free location 
on the stack. 



CASE 3 
LOW MEMORY 



CASE 4 
LOW MEMORY 



ARn- 



BOTTOM OF STACK 



TOP OF STACK 



(FREE) 



ARn 



BOTTOM OF STACK 



TOP OF STACK 



(FREE) 



HIGH MEMORY HIGH MEMORY 

Figure 6-15. implementions of Low-to-High Memory Stacks 
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6.5.2 Queues and Deques 



The implementations of queues and deques is based upon tlie manipulation 
of the auxiliary registers for user stacks. For queues, two auxiliary registers 
are used, one to mark the front of the queue from which data is popped and 
the other to mark the rear of the queue where data is pushed. 

For deques, two auxiliary registers are also necessary. One is used to mark 
one end of the deque, and the other is used to mark the other end. Data can 
be popped or pushed from either end. 
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Section 7 

Program Flow Control 



The TMS320C30 provides a complete set of flexible and powerful constructs 
that allow for software control of the program flow. These consist of two main 
types: repeat modes and branching (standard and delayed). When program- 
ming includes a combination of repeat modes, standard branches, and delayed 
branches, the type best suited for a particular application can be selected. 

Several interlocked operations instructions provide a flexible means of multi- 
processor support. Through the use of external signals, these instructions al- 
low for powerful synchronization mechanisms. They also guarantee the 
integrity of the communication and result in a high-speed operation. 

The TMS320C30 supports a nonmaskable external reset signal and a number 
of internal and external interrupts. These functions can be programmed for a 
particular application. 

Major topics discussed in this section include: 

• Repeat Modes (Section 7.1 on page 7-2) 

- Initialization 

- Operation 

• Delayed Branches (Section 7.2 on page 7-7) 

© Interlocked Operations (Section 7.3 on page 7-8) 

• Reset Operation (Section 7.4 on page 7-12) 
® Interrupts (Section 7.5 on page 7-15) 
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7.1 Repeat Modes 



The repeat modes of the TMS320C30 allow for the implementation of zero- 
overhead looping. For many algorithms, there is an inner kernel of code where 
most of the execution time is spent. Using the repeat modes allows these 
time-critical sections of code to be executed in the shortest possible time. 

The TMS320C30 provides two instructions to support zero-overhead looping: 
RPTB (repeat a block of code) and RPTS (repeat a single instruction). RPTB 
allows for a block of code to be repeated a specified number of times. RPTS 
allows a single instruction to be repeated a number of times and reduces the 
bus traffic by fetching the instruction only once. 

Three registers (RS, RE, and RC) are associated with the updating of the 
program counter when updated in a repeat mode. Table 7-1 describes these 
registers. 

Table 7-1. Repeat Mode Registers 




REGISTER 


FUIMCTIOIM 


RS 


Repeat Start Address Register. Holds the address of the first instruction 
of the block of code to be repeated. 


RE 


Repeat End Address Register. Holds the address of the last instruction 
of the block of code to be repeated. 


RC 


Repeat Count Register. Contains one less than the number of times the 
block remains to be repeated. 



7.1.1 Repeat Mode Initialization 



There are two bits that are very important to the operation of RPTB and RPTS, 
the RM and S bits. 

The RM (repeat mode flag) bit in the status register specifies if the processor 
is running in the repeat mode. If RM = 0, fetches are not made in repeat 
mode, if RM = 1, fetches are made in repeat mode. 

The S bit is hidden from the user, but is necessary to fully describe the oper- 
ation of RPTB and RPTS. If S = 0, the CPU is not performing fetches in the 
repeat-single mode. If S = 1 and RM = 1, the CPU is performing fetches in 
the repeat-single mode. 

The correct operation of the repeat modes requires that all of the above regis- 
ters and status register fields be initialized correctly. The RPTB and RPTS in- 
structions perform this initialization in slightly different ways (see Sections 
7.1 .2 and 7.1 .3). 
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7.1.2 RPTB Initialization 

When RPTB src is executed, the following operations take place: 

1) PC + 1 - RS 

2) src -* RE 

3) ^ -* RM status register bit 

4) - S bit. 

Step 1 loads the start address of the block into RS. Step 2 loads the src into 
the RE (end address of the block). The src operand is a 24-bit value con- 
tained in the instruction word. Step 3 sets the status register to indicate the 
repeat mode of operation. Step 4 indicates that this is the repeat block mode 
of operation. 

The last bit of information required is the number of times to repeat the block. 
The value is determined by properly initializing the RC (repeat count) register. 
Since the execution of RPTB does not load the RC, this register must be 
loaded explicitly by the user. The typical setup of the block repeat operation 
is shown below. 

LDI 15, RC ; 15 -^ RC 

RPTB LOOP ; LOOP -» RE, PC + 1 -> RS, 1 -* RM, -* S 

The repeat modes repeat a block of code at least once in a typical operation. 
The repeat counter should be loaded with one less than the number of times 
to repeat the block; i.e., a value of in RC repeats the block of code one time. 
All block repeats initiated by RPTB can be interrupted. 

7.1.3 RPTS Initialization 

When RPTS src is executed, the following sequence of operations occurs: 

1) PC + 1 -* RS 

2) PC + 1 -> RE 

3) ^ -* RM status register bit 

4) 1 -* S bit 

5) src -* RC 

The RPTS instuction loads all registers and mode bits necessary for the oper- 
ation of the single instruction repeat mode. Step 1 loads the start address of 
the block into RS. Step 2 loads the end address into the RE (end address of 
the block). Since this is a repeat of a single instruction, the start address and 
the end address are the same. Step 3 sets the status register to indicate the 
repeat mode of operation. Step 4 indicates that this is the repeat single-in- 
struction mode of operation. The operand src is loaded into RC. 

Repeats of a single instruction initiated by RPTS are not interruptible, since 
the RPTS fetches the instruction word only once and then keeps it in the in- 
struction register for reuse. An interrupt would cause the instruction word to 
be lost. The refetching of the instruction word from the instruction register 
reduces memory accesses and, in effect, acts as a one-word program cache. 
If it is necessary to have a single instruction that is repeatable and interruptible, 
the RPTB instruction may be used on this single instruction. 
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7.1.4 Repeat Mode Operation 



if RM == 1 



The information in the repeat mode registers and associated control bits is 
used to control the modification of the PC when the fetches are being made 
in repeat mode. The repeat modes compare the contents of the RE register 
with the program counter (PC). If they match and the repeat counter is non- 
negative, the repeat counter is decremented, the PC is loaded with the repeat 
start address, and the processing continues. The fetches and appropriate sta- 
tus bits are modified as necessary. Note that the repeat counter (RC) is never 
modified when RM is 0. The maximum number of repeats occurs when RC = 
OSOOOOOOOh. This will result in 080000001 h repetitions. The detailed algo- 
rithm for the update of the PC is described in Figure 7-1 . 



if S == 1 



if first time through 

fetch instruction from memory 
else 

fetch instruction from IR 



If in repeat mode (RPTB or RPTS) 

If RPTS 

If this is the first fetch 

Fetch instruction from memory 

If not the first fetch 

Fetch instruction from IR 




RC- 1 -► RC 
if RC < 

-► ST(RM) 

-► S 

PC + 1 -► PC 

else if S == 

fetch instruction from memory 
if PC == RE 

RC - 1 -> RC 
if RC>0 

RS ->■ PC 
else if RC < 
-■ ST(RM) 

-►s 

PC + 1 -► PC 



; Decrement RC 

; If RC is negative 

; Repeat single mode completed 

; Turn off repeat mode bit 

; Clear S 

; Increment PC 

If RPTB 

Fetch instruction from memory 

If this is the end of the block 

Decrement RC 

If RC is not negative 

Set PC to start of block 

If RC is negative 

Turn off repeat mode bits 

Clear S 

Increment PC 



Figure 7-1. Repeat Mode Control Algorithm 



The RPTB and RPTS are four-cycle instructions. These four cycles of over- 
head are only incurred on the first pass through the loop. All subsequent 
passes through the loop are accomplished with zero cycles of overhead. In 
Example 7-1, the block of code from STLOOP to ENDLOP is repeated sixteen 
times. 



Example 7-1. RPTB Operation 



LD 
RPTB 



STLOOP 



15, RC ; Load repeat counter with 15 
ENDLOP ; Execute the block of code 

; from STLOOP to ENDLOP 16 times 



ENDLOP 
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Using this mode of modifying the PC allows for a straightforward analysis of 
what would happen in the case of branches within the block. It is best to look 
at the operation from the point of view that the next value of the PC will be 
either PC + 1 or the contents of the RS register. It is thus apparent that this 
method of block repeat allows for any amount of branching within the re- 
peated block. Execution can go anywhere within the user's code via interrupts, 
subroutine calls, etc. For proper modification of the loop counter, the last in- 
struction of the loop must be fetched. The repeating of the loop can be 
stopped prior to completion by writing a into the repeat counter or writing 
into the RM bit of the status register. 

Since the block repeat modes modify the program counter, other instructions 
cannot modify the program counter at the same time. Two rules apply here: 

1 ) The last instruction in the block (or the only instruction in a block of size 
one) cannot be a Bcond, BR, DQcond, CALL, CALLcond, TRAPcond, 
RETlcond, RETScond, IDLE, RPTB, or RPTS. Example 7-2 shows an 
incorrectly placed standard branch. 

2) None of the last four instructions from the bottom of the block (or the 
only instruction in a block of size one) can be a BcondD, BRD, or 
DBcondD. Example 7-3 shows an incorrectly placed delayed branch. 

If either of these rules are violated, the PC will be undefined. 

Example 7-2. Incorrectly Placed Standard Branch 

Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOP 16 times 



; This branch violates rule 1 






LD 


15, RC 




RPTB 


ENDLOP 


STLOOP 


JCS 




ENDLOP 


BR 


OOPS 



Example 7-3. Incorrectly Placed Delayed Branch 





LD 


15, RC 




RPTB 


ENDLOP 


STLOOP 


GAP 






BRD 


OOPS 




ADDF 






MPYF 




ENDLOP 


SUBF 





Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOP 16 times 



; This branch violates rule 2 



Block repeats (RPTB) are nestable. Since all of the control is defined by the 
RS, RE, RC, and ST registers, saving and restoring these registers allows for 
their nesting. The RM bit in the status register can be used to determine if the 
block repeat mode is active. For example, if an interrupt service routine is 
written which requires the use of RPTB, it is possible that the interrupt asso- 
ciated with the routine may occur during another block repeat. The interrupt 
service routine can check the RM bit. If this bit is set, the interrupt routine 
saves RS, RE, RC, and ST. The interrupt routine can then perform a block re- 
peat. Before returning to the interrupted routine, the interrupt routine restores 
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RS, RE, RC, and ST. If the RM bit is not set, the save and restore of these re- 
gisters is not necessary. 
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7.2 Delayed Branches 

The branching capabilities of the Ti\/IS320C30 include two main types: stan- 
dard and delayed branches. Standard branches empty the pipeline before 
performing the branch to guarantee correct management of the program 
counter. This results in a TMS320C30 branch taking four cycles, included in 
this class are calls, returns, and traps. 

Delayed branches on the TMS320C30 do not empty the pipeline, but rather 
guarantee that the next three instructions will be executed before the program 
counter is modified by the branch. The result is a branch that only requires a 
single cycle, thus making the speed of the delayed branch very close to the 
optimal block repeat modes of the TMS320C30. However, unlike block repeat 
modes, delayed branches may be used in situations other than looping. Every 
delayed branch has a standard branch counterpart that is used when a delayed 
branch cannot be used. The delayed branches of the TMS320C30 are 
BcondD, BRD, and DBcondD. 

Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. They do not depend 
upon the instructions following the delayed branch. Delayed branches are 
guaranteed to allow the three following instructions to be executed regardless 
of other pipeline conflicts. 

When a delayed branch is fetched, it remains pending until the three following 
instructions are executed. None of the three instructions that follow a delayed 
branch can be Bcond, BcondD, BR, BRD, DBcond, DBcondD, CALL, 
CALLcond, TRAPcond, RET\cond, RETScond, RPTB, RPTS, or IDLE, (see 
Example 7-4). 

Delayed branches disable interrupts until the three instructions following the 
delayed branch are completed. This is independent of whether or not the 
branch is taken. 

If delayed branches are used incorrectly, the PC will be undefined. 

Example 7-4. Incorrectly Placed Delayed Branches 

Bl: BD LI 

NOP 

NOP 
B2 : B L2 ; This branch is incorrectly placed 

NOP 

MJH 

NOP 




7-7 



Program Flow Control - Interlocked Operations 



7.3 Interlocked Operations 



One of the most common multiprocessing configurations is the sharing of 
global memory by multiple processors. In order to allow multiple processors 
to access this global memory and share data in a coherent manner, some sort 
of arbitration or handshaking is necessary. This requirement for arbitration is 
the purpose of the TMS320C30 interlocked operations. 

The TMS320C30 provides a flexible means of multiprocessor support with five 
instructions, referred to as interlocked operations. Through the use of external 
signals, these instructions allow for powerful synchronization mechanisms. 
They also guarantee the integrity of the communication and result in a high- 
speed operation. The interlocked-operation instruction group is listed in Table 
7-2. 

Table 7-2. Interlocked Operations 



MNEMONIC 


DESCRIPTION 


OPERATION 


LDFI 


Load floating-point value into a register, 
interlocked 


Signal interlocked 

src -* dst 


LDII 


Load integer into a register, interlocked 


Signal interlocked 
src -* dst 


SIGI 


Signal, interlocked 


Signal interlocked 
Clear interlock 


STFI 


Store floating-point value to memory, 
interlocked 


src -* dst 
Clear interlock 


STIl 


Store integer to memory, interlocked 


src -♦dst 
Clear interlock 



The interlocked operations use the two external flag pins, XFO and XF1 . XFO 
must be configured as an output pin, and XF1 as an input pin. When config- 
ured in this manner, XFO signals an interlock operation request, and XF1 acts 
as an acknowledge signal for the requested interlocked operation. In this 
mode, XFO and XF1 are treated as active-low signals. 

The external timing for the interlocked loads and stores are the same as stan- 
dard load and stores. The interlocked loads and stores m ay b e extend ed like 
standard accesses, by using the appropriate ready signal (RDY or XRDY). 

The LDFI and LDII instructions perform the following actions: 

1 ) Simultaneously set XFO to and begin a read cycle. The timing of XFO 
is similar to that of the address bus during a read cycle. 

2) Execute an LDF or LDI inst ruc tion a nd extend the read cycle until XF1 
is set to and a ready (RDY or XRDY) is signalled. 

3) Leave XFO set to and end the read cycle. 

The read/write operation is identical to any other read/write cycle except for 
the special use of XFO and XF1 . The src operand for LDFI and LDII is always 
a direct or ind irect memory ad dress. X FO is set to only if the src is located 
off-chip; i.e., (STRB, MSTRB or lOSTRB is active), or the src is one of the on- 
chip peripherals. If on-chip memory is accessed, then XFO is not asserted, and 
the operation is as an LDF or LDI from internal memory. 
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The STFI and STII instructions perform the following operations: 

1 ) Simultaneously set XFO to 1 and begin a write cycle. The timing of XFO 
is similar to that of the address bus during a write cycle. 

2) Execut e an S TF or STI instruction and extend the write cycle until a 
ready (RDY or XRDY) is signalled. 

As in the case for L DFI and LDII , th e dst of STFI and STII affects XFO if dst 
is located off-chip (STRB, MSTRB, or lOSTRB is active), or the src is one of the 
on-chip peripherals. If on-chip memory is accessed, then XFO is not asserted 
and the operations are as an STF or STI to internal memory. 

The SIGI instruction functions as follows: 

1 ) Sets XFO to 0. 

2) Idles until XF1 is set to 0. 

3) Sets XFO to 1 and ends the operation. 

While the LDFI, LDII, and SIGI instructions are waiting for XF1 to be set to 
0, they may be interrupted. LDFI and LDII require a ready signal in order to 
be interrupted. This allows the user to implement protection mechanisms 
against deadlock conditions by interrupting an interlocked load that has taken 
too long. Upon return from the interrupt, the next instruction is executed. The 
STFI and STII instructions are not interruptible. 

Interlocked operations can be used to implement a busy-waiting loop, to ma- 
nipulate a multiprocessor counter, to implement a simple semaphore mech- 
anism, or to perform synchronization between two TMS320C30s. The 
following examples illustrate the usefulness of the interlocked operations in- 
structions. Example 7-5 shows the implementation of a busy-waiting loop. If 
location LOCK is the interlock for a critical section of code, and a nonzero 
means the lock is busy, the algorithm for a busy-waiting loop can be used as 
shown in Example 7-5. 




Example 7-5. Busy-Waiting Loop 

LDI: 1,R0 
LI: LDII @L0CK,R1 

STII R0,@LOCK 

BNZ LI 



Put 1 in RO 

Interlocked operation begun 

Contents of LOCK -♦ Rl 

Put RO (= 1) into LOCK, XFO = 

Interlocked operation ended 

Keep trying until LOCK = 



Example 7-6 shows how a location COUNT may contain a count of the 
number of times a particular operation needs to be performed. This operation 
may be performed by any processor in the system. If the count is zero, the 
processor waits until it is nonzero before beginning processing. The algorithm 
for modifying COUNT correctly is shown in Example 7-6. 
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Example 7-6. Multiprocessor Counter Manipulation 

4,I0F 
§C0UNT,R1 



CT: 



OR 



LDII 



SUBI 


1,R1 


STII 


R1,@C0UNT 



XFO = 1 

Interlocked operation ended 
Interlocked operation begun 
Contents of COUNT -♦ Rl 
If COUNT = 0, keep trying 
Decrement Rl(= COUNT) 
Update COUNT, XFO = 1 
Interlocked operation ended 



Figure 7-2 illustrates multiple TI\/IS320C30s sharing global memory and using 
the interlocked instructions as in Examples 7-7, 7-8, 7-9, and 7-10. 



Global 
Memory 







Arbitration 
Logic 



Lock, Count, or 8 



XFO XF1 



TMS320C30 



(X)A 



(XID 



CTRL 

— •— 



Local 
Memory 



(X)A 



(XID 



CTRL 



XFO XF1 



TMS320C30 

ff2 



Local 
Memory 



Figure 7-2. Multiple TMS320C30s Sharing Global Memory 



Sometimes it may be necessary for several processors to access some shared 
data or other common resources. The portion of code which must access the 
shared data is called a critical section. 

To ease the programming of critical sections, semaphores may be used. Se- 
maphores are variables which can only take non-negative integer values. Two 
primitive, indivisible, operations are defined on semaphores, namely (with S 
being a semaphore): 
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V(S) 
P(S) 



S + 1 ^ s 

if (S == 0) , go to P 

else S - 1 ~* S 



Indivisibility of V(S) and P(S) means that when these processes access and 
modify the semaphore S, they are the only processes accessing and modifying 
S. 

To enter a critical section, a P operation is performed on a common sema- 
phore, say S (S is initialized to 1). The first processor performing P(S) will 
be able to enter its critical section. All other processors are blocked since S 
has become 0. After leaving its critical section, the processor performs a V(S), 
thus allowing another processor to execute P(S) successfully. 

The TMS320C30 code for V(S) is shown in Example 7-7, and code for P(S) 
is shown in Example 7-8. Compare the code in Example 7-8 to the code in 
Example 7-6. 

Example 7-7. Implementation of V(S) 

LDII @S,RO 



V: 



; Interlocked read of S begins (XFO = 0) 

; Contents of S -♦ RO 

ADDI 1,R0 ; Increment RO (= S) 

STII RO,(as ; Update S, end interlock (XFO = 0) 



Example 7-8. implementation of P(S) 




P: OR 4,I0F ; End interlock (XFO = 1) 

LDII (as,RO ; Interlocked read of S begins 

; Contents of S -* RO 

BZ P ; If S = 0, go to P and try again 

SUBI 1,R0 ; Decrement RO (= S) 

STII RO,@S ; Update S, end interlock (XFO = 1) 

The SIGI operation may be used to synchronize, at an instruction level, multi- 
ple TMS320C30S. Consider two processors connected as shown in Figure 
7-3 The code for the two processors is shown in Example 7-9 



TMS320C30 #1 




TMS320C30 #2 


XFO 
XF1 




XF1 
XFO 







Figure 7-3. Zero-Logic Interconnect of TMS320C30s 
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Processor #1 runs until it executes the SIGI. It then waits until processor #2 
executes a SIGI. At this point, the two processors have synchronized and 
continue execution. 

Example 7-9. Code to Synchronize Two TMS320C30s at the Software Level 



Time 




Code for TMS320C30 #1 



SIGI 



Code for TMS320C30 #2 



(WAIT) 



Synchronization Occurs 



SIGI 
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7.4 Reset Operation 



The TMS320C30 supports a nonmaskable external reset signal (RESET), which 
is used to perform system reset. This section discusses the reset operation. 



At powerup, the state of the TMS320C30 processor is undefined. The RESET 
signal is used to place the processor in a known state. This signal must be 
asserted low for 10 or more HI clock cycles to guarantee a system reset. HI 
is an output clock signal generated by the TMS320C30 (see Appendix A for 
more information). 

Reset affects the other pins on the device in either a synchronous or asyn- 
chronous manner. The synchronous reset is gated by the TMS320C30s in- 
terna! clocks. The asynchronous reset directly affects the pins, and is faster 
than the s ynchro nous reset. Table 7-3 shows the state of the TMS320C30s 
pins after RESET = 0. Each pin is described according to whether the pin is 
reset synchronously or asynchronously. 



Table 7-3. Pin Operation at Reset 



SIGNAL 


#PINS 


OPERATION AT RESET 


PRIMARY INTERFACE (61 PINS) 


D(31-0) 


32 


Synchronous reset. Placed in high-impedance state. 


A(23-0) 


24 


Synchronous reset. Placed in high-impedance state. 


R/W 




Synchronous reset. Deasserted by going to a high level. 


STRB 




Synchronous reset. Deasserted by going to a high level. 


RDY 




Reset has no effect. 


HOLD 




Reset has no effect. 


HOLDA 




Reset has no effect. 


EXPANSION INTERFACE (49 PINS) 


XD(31-0) 


32 


Synchronous reset. Placed in high-impedance state. 


XA(12-0) 


13 


Synchronous reset. Placed in high-impedance state. 


XR/W 


1 


Synchronous reset. Deasserted by going to a high level. 


MSTRB 


1 


Synchronous reset. Deasserted by going to a high level. 


lOSTRB 


1 


Synchronous reset. Deasserted by going to a high level. 


XRDY 


1 


Reset has no effect. 


CONTROL SIGNALS (9 PINS) 


RESET 


1 


Reset input pin 


INT(3-0) 


4 


Reset has no effect. 


lACK 


1 


Synchronous reset. Deasserted by going to a high level. 


MC/MP 


1 


Reset has no effect. 


XF(1-0) 


2 


Asynchronous reset. Placed in high-impedance state. 
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Table 7-3. Pin Operation at Reset (Continued) 



SIGNAL 


# PINS 


OPERATION AT RESET 


SERIAL PORT SIGNALS (6 PINS) 








CLKXO 




Asynchronous reset. Placed in high-impedance state. 


DXO 




Asynchronous reset. Placed in high-impedance state. 


FSXO 




Asynchronous reset. Placed in high- impedance state. 


CLKRO 




Asynchronous reset. Placed in high-impedance state. 


DRO 




Asynchronous reset. Placed in high-impedance state. 


FSRO 




Asynchronous reset. Placed in high-impedance state. 


SERIAL PORT 1 SIGNALS (6 PINS) 


CLKX1 




Asynchronous reset. Placed in high-impedance state. 


DX1 




Asynchronous reset. Placed in high-impedance state. 


FSX1 




Asynchronous reset. Placed in high-impedance state. 


CLKR1 




Asynchronous reset. Placed in high-impedance state. 


DR1 




Asynchronous reset. Placed in high-impedance state. 


FSR1 




Asynchronous reset. Placed in high-impedance state. 


TIMER SIGNAL (1 PIN) 


TCLKO 


1 


Asynchronous reset. Placed in high-impedance state. 


TIMER 1 SIGNAL (1 PIN) 


TCLKl 


1 


Asynchronous reset. Placed in high-impedance state. 


SUPPLY and OSCILLATOR SIGNALS (29 PINS) 


Vdd{3-0) 


4 


Reset has no effect. 


IODVdd(1-0 


2 


Reset has no effect. 


ADVdd(I.O) 


2 


Reset has no effect. 


PDVdd 


1 


Reset has no effect. 


DDVdd(I.O) 


2 


Reset has no effect. 


MDVdd 


1 


Reset has no effect. 


Vss(3-0) 


4 


Reset has no effect. 


DVss(3-0) 


4 


Reset has no effect. 


CVss(1<0) 


2 


Reset has no effect. 


ivss 




Reset has no effect. 


VbBP 




Reset has no effect. 


SUBS 




Reset has no effect. 


XI 




Reset has no effect. 


X2/CLKIN 




Reset has no effect. 


HI 




Synchronous reset. Will go to its initial state when RESET 
makes a 1 to transition. See Appendix A. 


H3 


1 


Synchronous reset. Will go to its initial state when RESET 
makes a 1 to transition. See Appendix A. 
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Table 7-3. Pin Operation at Reset (Concluded) 



SIGNAL 


# PIIMS 


OPERATION AT RESET 


EMULATION, TEST, and RESERVED (18 PINS) 


EMUO 


F14 


Undefined. 


EMU1 


El 5 


Undefined. 


EMU2 


F13 


Undefined. 


EMU3 


E14 


Undefined. 


EMU4 


F12 


Undefined. 


EMU5 


CI 


Undefined. 


EMU6 


M6 


Undefined. 


RSVO 


J3 


Undefined. 


RSV1 


J4 


Undefined. 


RSV2 


K1 


Undefined. 


RSV3 


K2 


Undefined. 


RSV4 


LI 


Undefined. 


RSV5 


K3 


Undefined. 


RSV6 


L2 


Undefined. 


RSV7 


K4 


Undefined. 


RSV8 


Ml 


Undefined. 


RSV9 


L3 


Undefined. 


RSV10 


M2 


Undefined. 




At system reset, the following additional operations are performed: 

O The peripherals are reset. This is a synchronous operation. The periph- 
eral reset is described in Section 9. 

© The following CPU registers are loaded with zero: 

ST (CPU status register) 

IE (CPU/DMA interrupt enable flags) 

IF (CPU interrupt flags) 

lOF (I/O flags) 

O The reset vector is read from memory location Oh and loaded into the 
PC. This vector contains the start address of the system reset routine 

o Execution begins. Refer to Section 12 an example of a processor in- 
itialization routine. 

Multiple TMS320C30S driven by the same s ystem clock may be reset and 
synchronized. When the 1 to transition of RESET occurs, the processor is 
placed on a well-defined internal phase, and all of the TMS320C30s will come 
up on the same internal phase. 
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7.5 Interrupts 



IFTfn — D Q 




CLK 



HI 



The TMS320C30 supports multiple Internal and external interrupts, which can 
be used for a variety of applications. This section discusses the operation of 
these interrupts. 

A functional diagram of the logic used to implement the external interrupt in- 
puts is shown in Figure 7-4; the logic for internal interrupts is similar. Addi- 
tional information regarding internal interrupts can be found in Section 9. 
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Figure 7-4. Interrupt Logic Functional Diagram 



External interrupts are synchronized internally as illustrated by the three flip- 
flops clocked by HI and H3. Once synchronized, the interrupt input will set 
the corresponding Interrupt Flag register (IF) bit if the interrupt is active. 

External interrupts can be effectively either edge- or level-triggered, depending 
on the duration of the low level on the interrupt input. An external interrupt 
must be held low for at least one H1/H3 cycle to be recognized by the 
TMS320C30. If the interrupt is held low for between one and three cycles, 
then only one interrupt is recognized. If the interrupt is held low three or more 
cycles, more than one interrupt may be recognized depending on how rapidly 
interrupts are serviced. 

When a particular interrupt is processed by the CPU or DMA controller, the 
corresponding interrupt flag bit is cleared by the internal interrupt acknowl- 
edge signal. It should be noted, however, that if INTn is still low when the 
interrupt acknowledge signal occurs, t he in terrupt flag bit will only be cleared 
for one cycle and then set again since INTn is still low. Accordingly, it is the- 
oretically possible that , dep ending on when the IF register is read, this bit may 
be zero even though INTn is zero. When the TMS320C30 is reset, zero is 
written to the interrupt flag register, thereby clearing all pending interrupts. 
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The interrupt flag register bits may be read and written under software control. 
Writing a 1 to a IF register bit sets the associated interrupt flag to 1 . Similiarly, 
writing a resets the corresponding interrupt flag to 0. In this way, all inter- 
rupts may be triggered and/or cleared through software. Since the interrupts 
flags may be read, the interrupt pins may be polled in software when an in- 
terrupt-driven interface is not required. 

Internal interrupts operate in a similar manner. The bit in the IF register corre- 
sponding to an internal interrupt may be read and written through software. 
Writing a 1 sets the interrupt latch, and writing a clears it. All internal in- 
terrupts are one H1/H3 cycle in length. 

The CPU global interrupt enable bit (GIE), located in the CPU status register 
(ST), controls all CPU interrupts. All DMA interrupts are controlled by the 
DMA global interrupt enable bit, which is not dependent upon ST(GIE) and 
is local to the DMA. The DMA global interrupt enable bit is dependent, in 
part, upon the state of the DMA SYNCH bits. It is not directly accessible 
through software (see Section 9). The AND of the interrupt flag bit and the 
interrupt enables is then connected to the interrupt processor. 

To provide for maximum performance in servicing interrupts, the interrupt ac- 
knowledge (lACK) instruction is provided. lACK drives the lACK pin and 
performs a dummy read. The read is peformed from the address specified by 
the JACK instruction operand. When lACK is used, it typically is placed in the 
early portion of an interrupt service routine. For certain applications, it may 
be better suited at the end of the interrupt service routine or be totally unnec- 
essary. 

The CPU controls all prioritization of interrupts (see Table 7-4 for reset and 
interrupt vector locations and priorities). If the DMA is not using interrupts 
for synchronization of transfers, it will not be affected by the processing of the 
CPU interrupts. If the CPU is involved in a pipeline conflict (branch, register, 
or memory), it will not respond to the interrupts until that conflict is resolved. 
It is therefore possible to interrupt the CPU and DMA simultaneously with the 
same or different interrupts and, in effect, synchronize their activities. For ex- 
ample, it may be necessary to cause a high-priority DMA transfer that avoids 
bus conflicts with the CPU, i.e., make the DMA higher priority than the CPU. 
This may be accomplished using an interrupt that causes the CPU to trap to 
an interrupt routine that contains an IDLE instruction. Then if the same in- 
terrupt is used to synchronize DMA transfers, the DMA transfer counter can 
be used to generate an interrupt, and thus return control to the CPU following 
the DMA transfer. 

Since the DMA and CPU share the same set of interrupt flags, the DMA may 
clear an interrupt flag before the CPU can respond to it. For example, if the 
CPU interrupts are disabled, the DMA can be responding to interrupts and 
thus clearing the associated interrupt flags. 
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Table 7-4. Reset and Interrupt Vector Locations 



RESET OR 
INTERRUPT 


VECTOR 
LOCATION 


PRIORITY 


FUNCTION 


RESET 


Oh 





External reset signal input on the RESET 
pin. 


INTO 


1h 


1 


External interrupt input on the INTO pin. 


INT1 


2h 


2 


External interrupt input on the INT1 pin. 


INT2 


3h 


3 


External interrupt input on the INT2 pin. 


INT3 


4h 


4 


External interrupt input on the INT3 pin. 


XINTO 


5h 


5 


Internal interrupt generated when serial 
port transmit buffer is empty. 


RINTO 


6h 


6 


Internal interrupt generated when serial 
port receive buffer is full. 


XINT1 


7h 


7 


Internal interrupt generated when serial 
port 1 transmit buffer is empty. 


RINT1 


8h 


8 


Internal interrupt generated when serial 
port 1 receive buffer is full. 


TINTO 


9h 


9 


Internal interrupt generated by timer 0. 


TINT1 


OAh 


10 


Internal interrupt generated by timer 1. 


DINT 


OBh 


11 


Internal interrupt generated by DMA con- 
troller 0. 



If there is a delayed branch in the pipeline, interrupts are held pending until 
after the branch. If the interrupt occurs in the first cycle of the fetch of an in- 
struction, the fetched instruction is discarded (not executed), and the address 
of that instruction is pushed to the top of the system stack. If the interrupt 
occurs after the first cycle of the fetch, in the case of a multicycle fetch due to 
wait states, that instruction is executed and the address of the next instruction 
to be fetched is pushed to the top of the system stack. If no program fetch is 
occurring, then no new fetch is performed. After the address of the appropri- 
ate instruction has been pushed, the interrupt vector is fetched, loaded into the 
PC, and execution continues. 

The TMS320C30 allows the CPU and DMA to respond to and process inter- 
rupts in parallel. Figure 7-5 shows interrupt processing flow. The interrupts 
are polled and the CPU and DMA begin processing them. In the interrupt flow 
pertaining to the the CPU, the interrupt flag corresponding to the highest- 
priority enabled interrupt is cleared, and GIE is set to 0. The CPU completes 
all fetched instructions. The interrupt vector is fetched and loaded into the PC, 
and the CPU continues execution. The DMA cycle is similar to that for the 
CPU. After the pertinent interrupt flag is cleared, the DMA proceeds based 
upon the status of the SYNCH bits in the DMA global control register. 
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NO 




IF ENABLED 

INTERRUPT IS 

A CPU INTERRUPT 



CLEAR INTERRUPT FLAG 
CPU GIE-* — 



IF ENABLED 

INTERRUPT IS 

A DMA INTERRUPT 



CLEAR INTERRUPT FLAG 



COMPLETE ALL 
FETCHED INSTRUCTIONS 



PC -»•*( + +SP) 
FETCH INTERRUPT VECTOR 



CPU CONTINUES 



DMA PROCEEDS BASED 
UPON SYNCH BITS 



DMA CONTINUES 




Figure 7-5. Interrupt Processing 
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Section 8 

External Bus Operation 



Two external interfaces are provided on the TI\/IS320C30: the primary bus and 
the expansion bus. These are used to access memories and external peripheral 
devices. Software controlled wait states and an external input signal provide 
for wait state generation. 

Major topics discussed in this hardware interface section are listed below. 

• External Interface Control Registers (Section 8.1 on page 8-2) 

- Primary bus 

- Expansion bus 

O External Interface Timing (Section 8.2 on page 8-5) 
© Programmable Wait States (Section 8.3 on page 8-18) 

• Programmable Bank Switching (Section 8.4 on page 8-20) 
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8.1 External Interface Control Registers 

The TMS320C30 provides two external interfaces: the primary bus and the 
expansion bus. The primary bus consists of a 32-bit data bus, a 24-bit address 
bus, and a set of control signals. The expansion bus consists of a 32-bit data 
bus, a 13-bit address bus and a set of control signals. Both buses support 
software-controlled wait states and an external ready input signal. Both buses 
support data, program, and I/O accesses. 



When a primary bus access is performed, STRB is low. The expansion bus 
supports two types of accesses. One is used primarily for memory accesses 
that are signalled by MSTRB low. The timing for a MSTRB access is the same 
as that of the STRB access on the parallel interface. The other type of expan- 
sion bus access is commonly used for access of external peripheral devices 
and is signalled by lOSTRB low. 

The primary bus and the expansion bus each have an associated control reg- 
ister. These registers are memory-mapped as shown in Figure 8-1 . 



REGISTER 


PERIPHERAL 
ADDRESS 


EXPANSION BUS CONTROL 


808060h 


RESERVED 


808061 h 


RESERVED 


808062h 


RESERVED 


808063h 


PRIMARY BUS CONTROL 


808064h 


RESERVED 


808065h 


RESERVED 


808066h 


RESERVED 


808067h 


RESERVED 


808068h 


RESERVED 


808069h 


RESERVED 


80806Ah 


RESERVED 


80806Bh 


RESERVED 


80806Ch 


RESERVED 


80806Dh 


RESERVED 


80806Eh 


RESERVED 


80806Fh 



Figure 8-1. Memory-Mapped External Interface Control Registers 
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8.1.1 Primary Bus Control Register 

The primary bus control register is a 32-bit register that contains the control 
bits for the primary bus (see Figure 8-2). Table 8-1 lists the register bits with 
the bit names and functions. 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 


14 


13 


12 


11 10 9 


8 


7 6 5 


4 3 


2 


1 





XX 


XX 


XX 


BNKCMP 


WTCNT 


SWW 


HIZ 


NOHOLD 


HOLDST 



R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 



NOTE: 



XX = reserved bit, read as 0. 
R = read, W = write. 



Figure 8-2. Primary Bus Control Register 



Table 8-1. Primary Bus Control Register Bits Summary 



BIT 


NAME 


FUNCTIOIM 





HOLDST 


Hold status bit. This bit signals if the port is being held 
(HOLDST = 1) or is not being held (HOLDST = 0). This sta- 
tus bit is valid whether the port has been held via hardware or 
software. 


1 


NOHOLD 


Port hold signal. NOHOLD allows or disallows the port to be 
held by an external HOLD signal. When NOHOLD = 1, the 
TMS320C30 takes over the external bus and controls it re- 
gardless of requests by external devices. No hold acknowledge 
(HOLDA) is asserted when a HOLD is received. However, it 
is asserted if an internal hold is generated (HIZ = 1 ). NOHOLD 
is set to at reset. 


2 


HIZ 


Internal hold. When set (HIZ = 1), the port is put in hold 
mode. This equivalent to the external HOLD signal. By forcing 
a three-state condition, the TMS320C30 can relinquish the 
external memory port through software. HOLDA goes low 
when the port is three-stated. HIZ is set to at reset. 


3-4 


SWW 


Software wait-state generation. In conjunction with WTCNT, 
this 2-bit field defines the mode of wait-state generation. It is 
set to 1 1 at reset. 


5-7 


WTCNT 


Software wait mode. This 3-bit field specifies the number of 
cycles to use when in software wait mode for the generation 
of internal wait states. The range is zero (WTCNT = 0) to 
seven (WTCNT = 1 1 1) H1/H3 cycles. It is set to 1 1 1 at 
reset. 


8-12 


BNKCMP 


Bank compare. This 5-bit field specifies the number of MSBs 
of the address to be used to define the bank size. It is set to 1 
at reset. 


13-31 


Reserved 


Read as 0. 
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8.1.2 Expansion Bus Control Register 

The expansion bus control register is a 32-bit register that contains control 
bits for the expansion bus (see Figure 8-3 and Table 8-2). 



31 


30 


29 


28 


27 


26 


25 


24 


23 22 21 


20 19 


18 


17 


16 


1 XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 1 XX 1 XX 


XX 1 XX 


XX 


XX 


XX 1 


15 


14 


13 


12 


11 


10 


g 


8 


7 6 5 


4 3 


2 


1 





1 XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


WTCNT 


SWW 


XX 


XX 


XX 1 



R/W R/W R/W R/W R/W 



NOTE: 



XX = reserved bit, read as 0. 
R = read, W = write. 



Figure 8-3. Expansion Bus Control Register 



Table 8-2. Expansion Bus Control Register Bits Summary 



BIT 


NAME 


FUNCTION 


0-2 


Reserved 


Read as 0. 


3-4 


SWW 


Software wait-state generation. In conjunction with the 
WTCNT, this 2-bit field defines the mode of wait-state gener- 
ation. It is set to 1 1 at reset. 


5-7 


WTCNT 


Software wait mode. This 3-bit field specifies the number of 
cycles to use when in software wait mode for the generation 
of internal wait states. The range is zero (WTCNT = 0) to 
seven (WTCNT = 111) H1/H3 clock cycles. It is set to 1 1 
1 at reset. 


8-31 


Reserved 


Read as 0. 
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8.2 External Interface Timing 

This section discusses functional timing of operations on the primary bus and 
the expansion bus, the TMS320C30's two independent parallel buses. De- 
tailed timing specifications for all TI\/1S320C30 signals are contained in Ap- 
pendix A, the TMS320C30 Data Sheet. 

The parallel buses implement three mutually exclusive address spaces distin- 
guished through the use of three separate control signals: STRB,MSTRB, and 
lOSTRB. The STRB signal controls accesses on the primary bus, and the MSTRB 
and lOSTRB control accesses on the expansion bus. Since the two buses are 
independent, two accesses may be made in parallel. 

With the exception of bank switching and the external HOLD function (dis- 
cussed later in this section), timing of primary bus cycles and MSTRB expan- 
sion bus cycles are identical, and will be discussed collectively. The acronym 
(M)STRB will be used in references which pertain equally to STRB and 
MSTRB. Similarly (X)R/W, (X)A, (X)D, and (X)RDY ar e used t o symbolize the 
equivalent primary and expansion bus signals. The lOSTRB expansion bus 
cycles are timed differently and will be discussed independently. 



8.2.1 Primary Bus Cycles 

All bus cycles comprise integral numbers of HI clock cycles. One HI cycle is 
defined to be from one falling edge of HI to the next falling edge of HI . For 
full speed (zero wait state) accesses, reads take one HI cycle, while writes 
take two cycles, unless the write follows a read, in which case the write takes 
three cycles. Recall that internally (from the perspective of the CPU and 
DMA) writes require only one cycle if no accesses to that interface are in 
progress. The following discussions pertain to zero wait state accesses unless 
otherwise specified. 



The (M)STRB signal is low for the active portion of both reads and writes, 
which las ts one HI cycle. Additionally before and after the active portion 
({M)STRB low) of writes only, there is a transition cycle of HI. During this 
transition cycle, the following occur: 



1) (M)STRB is high. 

2) If required, (X)R/W changes state on HI rising. 

3) If required, addresses changes on HI rising if the previous HI cycle was 
the active portion of a write. If the previous HI cycle was a read, address 
changes on HI falling. 



Figure 8-4 illustrates a read-read-write sequence for (M)STRB active and no 
wait states. The data is read as late in the cycle as possible to allow for the 
maximum access time from address valid. Note that although external writes 
take two cycles, internally (from the perspective of the CPU and DMA), they 
require only one cycle if no accesses to that interface are in progress. In the 
typical timing for all external interfaces, the (X)R/W strobe does not change 
until (M)STRB or lOSTRB goes inactive. 
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(X)RDY 




Figure 8-4. Read- Read-Write for (M)STRB = 
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Figure 8-5 llustrates a write-write-read sequence for (M)STRB active and no 
wait states. The address and data written are held valid approximately one- 
half cycle after {M)STRB changes. 




Figure 8-5. Write-Write-Read for (M)STRB = 
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Figure 8-6 illustrates a read c ycle with one wajt state. Since (X)RDY = 
1, the read cycle is extended. ( M)STRB, (X)R/W, and (X)A are also ex- 
tended one cycle. The next time (X)RDy is sampled, it is 0. 





(X)RD? 



7 



VA 



z: 



AlZ 



|«- extra cycle -^ 



Figure 8-6. Use of Wait States for Read for (M)STRB = 
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Figure 8-7 illustrates a write cycle with o ne wait s tate. Smce initially 
(X)RDY = 1, the write cycle is extend ed. (M) STRB, (X)R/W, and (X)A 
are extended one cycle. The next time (X)RDY is sampled, it is 0. 



H3 



H1 




(M)STRB 



(X)R/W 



(X)A 



\ 



/ 



(X)D 



(X)RDY 



( 



write data 



7T\ZX 



\ 



/ 



X 



>-^ 



write data 



z 



X 



^•— extra cycle -^ 



X- 



z 




Figure 8-7. Use of Walt States for Write for (M)STRB = 
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8.2.2 Expansion Bus I/O Cycles 



In contrast to primary bus and MSTRB cycles, lOSTRB reads and writes are both 
two cycles in duration (with no wait states) and exhibit the same timing. 
During these cycles, address always changes on the falling edge of HI, and 
lOSTRB is low from the ri sing edg e of the first HI cycle to the rising edge of 
the second HI cycle. The lOSTRB signal always goes inactive (high) between 
cycles, and XR/W is high for reads and low for writes. 



Figure 8-8 illustrates r ead and write cycles when lOSTRB is active and there 
are no wait states. For lOSTRB accesses, reads and writes require a minimum 
of two cycles. Some off-chip peripherals may change their status bits when 
read or written. Therefore, it is important that valid addresses be maintained 
when c ommunic ating w ith these peripherals. For reads and writes when 
lOSTRB is active, lOSTRB is completely framed by the address. 





Figure 8-8. Read and Write for lOSTRB = 
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Figure 8-9 illustrates a read with one wait state when lOSTRB is active, and 
Figure 8-10 illustrates a write w ith one wait state when lOSTRB is active. For 
each wait state added, lOSTRB, XR/W, and XA are extended one clock cycle. 
Writes hold the data on the bus one additional cycle. The sampling of XRDY 
is repeated each cycle. 




(X)RDY 



17 



VA:/^ 



{<— extra cycle — »j 



Figure 8-9. Read with One Wait-State for iOSTRB = 
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(X)RDY 




7T\ZX1Z 



l^— extra cycle— »| 



Figure 8-10. Write with One Wait-State for lOSTRB = 
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Figure 8-11 through Figure 8-13 illustrate the various transitions between 
memory reads and writes, and I/O writes over the expansion bis. 



H3 



H1 






MSTRB 



^ 



lOSTRB 



(X)/W 



r^i_vr-v_r 



/ 



(X)A mem add 



(X)D 
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X 
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Figure 8-11. Memory Read and I/O Write for Expansion Bus 
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Figure 8-12. I/O Write and Memory Read for Expansion Bus 
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MSTRB 



lOSTRB 



(X)R/W 



(X)A 



(X)D 



(X)RDY 



/ 



mem add 



( 



mem write 



> 



/ 



\ 
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Figure 8-13. Memory Write and I/O Read for Expansion Bus 
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Figure 8-14 and Fi gure 8-1 5 illustrate the signal states when a bus is inactive 
(after a lOSTRB or (M)STRB access respectively). The strobes (STRB, MSTRB, 
l OSTRB ) a nd (X )R/W go to 1 . The address is undefined, and the ready signal 
(XRDY or RDY) is ignored. 




Figure 8-14. Inactive Bus States for lOSTRB 
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Figure 8-15. Inactive Bus States for STRB and MSTRB 
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H3 



Figure 8-16 illustrates the timing for HOLD and HOLDA. HOLD is an external 
asynchronous input. There is a mi nimum of one cycl e delay from when the 
processor recognizes HOLD=0 until HOLDA=0. When HOLDA=0, the address, 
data buses, and associated strobes are placed in a high-impedance state. All 
accesses occurring over an interface are completed before a hold is acknowl- 
edged. 
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Figure 8-16. HOLD and HOLDA Timing 
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8.3 Programmable Wait States 

Both the parallel and expansion interfaces allow the control of wait-state 
generation through the manipulation of their associated memory- mapped 
control registers. The SWW field is used to select the mode of wait-state 
generation, and the WTCNT field is used to load an internal timer used in the 
generation of wait states. The following four modes of wait-state generation 
can be used: 

• External RDY 

• WTCNT-generated RDYwtcnt 

• Logical-AND of RDY and RDYwtcnt 

• Logical-OR of RDY and RDYwtcnt 

These four modes are u sed in the ge nerati on of the internal ready signal that 
controls accesses, RDY jnt- As long as RDYjnt = 1, the current external access 
is delayed. When RDYjnt ~ 0' the current access completes. Since the use of 
programmable wait states for both external interfaces is identical, only the 
primary bus interface is described in the following paragraphs. 

RDYwtcnt 'S an internally generated ready signal. When an external access is 
begun, the value in WTCNT is loaded into a counter. WTCNT may be any 
value from through 7. The counter is decremented every HI /H3 clock cycle 
until it becomes 0. Once the counter is set to , it remains set to until the 
next acce ss. While the counter is nonzero, RDYwtcnt ~ 1 ■ While the counter 
is 0, RDYwtcnt = 0. 

When SWW = 0, RDYjnt 's only dependent upon RDY. RDYwtcnt is ignored. 
The truth table for this mode is shown in Table 8-3. 
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Table 8-3. Wait-State Generation When SWW = 00 



RDY 


RDYwtcnt 


RDYin, 




1 
1 



1 


1 





1 
1 



When SWW = 1, RDYjn^ is only dependent upon RDYwtcnt- RDY is ignored. 
Table 8-4 shows the truth table for this mode. 

Table 8-4. Wait-State Generation When SWW = 01 



RDY 


RDYwtcnt 


RDYi,t 




1 
1 



1 


1 



1 

1 



When SWW = 1 0, RDYjn t is the lo gical-OR (electrical-AND, since these sig- 
nals are low true) of RDY and RDYwtcnt (see Table 8-5). 

Table 8-5. Wait-State Generation When SWW = 10 




RDY 


RDYvvtcnt 


RDYin, 




1 
1 



1 

1 





1 



When SWW = 11, RDYjnt 's the lo gical-AND (electrical-OR, since these sig- 
nals are low true) of RDY and RDYwtcnt The truth table for this mode is shown 
in Table 8-6. 

Table 8-8. Wait-State Generation When SWW = 11 



RDY 


RDYwtcnt 


RDYin, 




1 
1 



1 

1 




1 
1 
1 
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8.4 Programmable Bank Switching 

Programmable bank switching provides the capability of switching between 
external memory banks without the need for externally inserting wait states 
due to memories requiring several cycles to turn off. Bank switching is im- 
plemented on the primary bus and not on the expansion bus. 

The size of a bank is determined by the number of bits specified to be exam- 
ined. For example (see Figure 8-17), if BNKCMP =16, the 16 MSBs of the 
address are used to define a bank. Since addresses are 24 bits, the bank size 
is specified by the 8 LSBs, yielding a bank size of 256 words in this case. If 
BNKCMP > 1 6, only the 1 6 MSBs are compared. Banksizes from 2^ = 256 
to 2^4 = 16M are allowed. Table 8-7 summarizes the relationship between 
BNKCMP, the address bits used to define a bank, and the resulting bank size. 



24-bit Address ■ 



23 8 


7 







k- 



Number of bits to compare- 



Defines 



bank size 
Figure 8-17. BNKCMP Example 




Table 8-7. BNKCMP and Bank Size 



BNKCMP 


MSBs DEFINiNG A BANK 


BANK SIZE (32-BIT WORDS) 


00000 


None 


224 = 16M 


00001 


23 


223 = 8M 


00010 


23-22 


222 = 4M 


00011 


23-21 


221 = 2M 


00100 


23-20 


220 = 1 M 


00101 


23-19 


219 = 512K 


00110 


23-18 


218 = 256K 


00111 


23-17 


217 = 128K 


01000 


23-16 


216 = 64K 


01001 


23-15 


2l5 = 32K 


01010 


23-14 


2l4 = 16K 


01011 


23-13 


213 =8K 


01100 


23-12 


212 =4K 


01101 


23-11 


211 = 2K 


01110 


23-10 


210 =iK 


01111 


23-9 


29 =512 


10000 


23-8 


28 = 256 


10001 


Reserved 


Undefined 


through 






11111 
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Internai to the TMS320C30 is a register that contains the MSBs (as defined 
by the BNKCMP field) of the last address used for a read or write over the 
primary interface. At reset, the register bits are set to zero. If the MSBs of the 
address being used for the current primary interface read do not match those 
contained in this internal register, a read cycle is not asserted for one H1/H3 
clock cycle. During this extra clock cycle, the address bus switches over to the 
new address, but STRB is inactive (high). The contents of the internal register 
are replaced with the MSBs being used for the current read of the current ad- 
dress. If the MSBs of the address being used for the current read match the 
bits in the register, a normal read cycle takes place. 

If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When reading from a different memory bank, memory conflicts 
are avoided by the insertion of an extra cycle. This feature can be disabled 
by setting BIMKCMP to 0. The insertion of the extra cycle occurs only when a 
read is performed. The changing of the MSBs in the internal register occurs 
for all reads and writes over the primary interface. 

Figure 8-18 illustrates the addition of an inactive cycle when switches be- 
tween banks of memory occur. 
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[«— extra cycle— »{ 




Figure 8-18. Bank Switching Example 
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Section 9 

Peripherals 



The TMS320C30 provides two timers, two serial ports, and an on-chip Direct 
Memory Access (DMA) controller. These peripheral modules are manipulated 
through memory-mapped registers located on the dedicated peripheral bus. 

The DMA controller is used to perform input/output operations without in- 
terfering with the operation of the CPU. Therefore, it is possible to interface 
the TMS320C30 to slow external memories and peripherals (A/D's, serial 
ports, etc.) without reducing the computational throughput of the CPU. The 
result is improved system performance and decreased system cost. 

Major topics discussed in this section on peripherals are listed below. 

© Timers (Section 9.1 on page 9-2) 

- Registers 

- Pulse generation 

- Operation modes 

© Serial Ports (Section 9.2 on page 9-9) 

- Registers 

- Operation configurations 
Timing 

© DMA Controller (Section 9.3 on page 9-26) 

- Registers 

- VA memory transfer operation 

- Synchronization of DMA channels 
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9.1 Timers 



The TMS320C30 timer modules are general-purpose 32-blt timer/event 
counters, with two signalling modes and internal or external clocking (see 
Figure 9-1 ). The timer modules can be used to signal to the TMS320C30 or 
the external world at specified intervals, or to count external events. With an 
internal clock, the timer can be used to signal an external A/D converter to 
start a conversion, or it can interrupt the TMS320C30 DMA controller to begin 
a data transfer. With an external input, the timer can count external events and 
interrupt the CPU after a specified number of events. Available to each timer 
is an I/O pin that can be used either as an input clock to the timer, an output 
clock signal, or a general-purpose I/O pin. 



PERIOD REGISTER 
(31-0) 



COUNTER (32-BITS) 



COUNTER REGISTER 
(31-0) 



32 



COMPARATOR 



PERIOD = COUNTER 



■ INTERNAL CLOCK/2 

EXTERNAL 

CLOCK 

INV 







32 




PULSE GENERATOR 



INV 



■*- TSTAT 



TIMER OUT 

Figure 9-1. Timer Block Diagram 

Three memory-mapped registers are used by each timer. They are: 

• Global control register 

• Period register 

• Counter register 

The global control register determines the operating mode of the timer, moni- 
tors the timer status, and controls the function of the I/O pin of the timer. The 
period register specifies the timer's signalling frequency. The counter register 
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contains the current value of the incrementing counter. The timer can be in- 
cremented on the rising edge or the falling edge of the input clock. The 
counter is zeroed whenever its value equals that in the period register. The 
pulse generator generates two types of external clock signals: pulse or clock. 
The memory map for the timer modules is shown in Figure 9-2. 



Register 



TIMER GLOBAL CONTROL REGISTER 



RESERVED 



RESERVED 



RESERVED 



TIMER COUNTER REGISTER 



RESERVED 



RESERVED 



RESERVED 



TIMER PERIOD REGISTER 



RESERVED 



RESERVED 



RESERVED 



RESERVED 



RESERVED 



RESERVED 



RESERVED 



Peripheral Address 
Timer Timer 1 



808020h 
808021 h 
808022h 
808023h 
808024h 
808025h 
808026h 
808027h 
808028h 
808029h 
80802Ah 
80802Bh 
80802Ch 
80802Dh 
80802Eh 
80802Fh 



808030h 
808031 h 
808032h 
808033h 
808034h 
808035h 
808036h 
808037h 
808038h 
808039h 
80803Ah 
80803Bh 
80803Ch 
80803Dh 
80803Eh 
80803Fh 



Figure 9-2. Memory- Mapped Timer Locations 




9.1.1 Timer Global Control Register 

The timer global control register is a 32-bit register that contains the global 
and port control bits for the timer module. Table 9-1 defines the register bits, 
names, and functions. Bits 3-0 are the port control bits. Bits 11-6 are the 
timer global control bits. Figure 9-3 shows the 32-bit register. Note that at 
reset, all bits are set to except for DATIN (set to the value read on TCLK). 
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Table 9-1. Timer Global Control Register Bits Summary 




BITS 


NAME 


FUNCTION 





FUNC 


FUNC controls the function of TCLK. If FUNC - 0, TCLK is con- 
figured as a general-purpose digital I/O port. If FUNC = 1, TCLK 
is configured as a timer pin (see Figure 9-6) for a description of the 
relationship between FUNC and CLKSRC. 


1 


T/0 


If FUNC = and CLKSRC = 0, TCLK is configured as a general- 
purpose I/O pin. In this case, if T/0 = 0, TCLK is configured as a 
general-purpose input pin. If T/0 = 1, TCLK is configured as a 
general-purpose output pin. 


2 


DATOUT 


DATOUT drives TCLK when in I/O port mode. DATOUT can also 
be used as an input to the timer. 


3 


DATIN 


Data input on TCLK or DATOUT. A write has no effect. 


4-5 


Reserved 


Read as 0. 


6 


GO 


The GO bit resets and starts the timer counter. When GO = 1 and 
the timer is not held, the counter is zeroed and begins incrementing 
on the next rising edge of the timer input clock. The GO bit is 
cleared on the same rising edge. GO = has no effect on the timer. 


7 


HLD 


Counter hold signal. When this bit is zero, the counter is disabled 
and held in its current state. If the timer is driving TCLK, the state 
of TCLK is also held. The internal divide-by-two counter is also 
held so that the counter can continue where it left off when HLD 
is set to 1 . The timer registers can be read and modified while the 


timer is being held. RESET has priority over HLD. Table 9-2 shows 
the effect of writing to GO and HLD. 


8 


C/P 


Clock/Pulse mode control. When C/P = 1 , clock mode is chosen 
and the signalling of the status flag and external output will have a 
50 percent duty cycle. When C/P = 0, the status flag and external 
output will be active for one HI cycle during each timer period (see 
Figure 9-4). 


9 


CLKSRC 


Specifies the source of the timer clock. When CLKSRC = 1, an in- 
ternal clock with frequency equal to one-half the HI frequency is 
used to increment the counter. The INV bit has no effect on the 
internal clock source. When CLKSRC = 0, an external signal from 
the TCLK pin can be used to increment the counter. The external 
clock is synchronized internally, thus allowing external asynchro- 
nous clock sources not exceeding the specified maximum allowable 
external clock frequency. This will be less than f(H1)/2. (See Fig- 
ure 9-6 for a description of the relationship between FUNC and 
CLKSRC). 


10 


INV 


Inverter control bit. If an external clock source is used and INV = 
1 , the external clock is inverted as it goes into the counter. If the 
output of the pulse generator is routed to TCLK and INV = 1, the 
output is inverted before it goes to TCLK (see Figure 9-1). If INV 
= 0, no inversion is performed on the input or output of the timer. 
The INV bit has no effect, regardless of its value, when TCLK is used 
in I/O port mode. 


11 


TSTAT 


This bit indicates the status of the timer. It tracks the output of the 
uninverted TCLK pin. This flag sets a CPU interrupt on a 
transition from to 1 . A write has no effect. 


12-31 


Reserved 


Read as 0. 
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27 


26 


25 
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23 
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21 


20 


19 


18 


17 16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 14 13 12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 


XX 


XX 


XX 


XX 


TSTAT 


INV 


CLKSRC 


C/P 


HLD 


GO 


XX 


XX 


DATIN 


DATOUT 


T/0 FUNC 



R/W R/W R/W R/W R/W R/W 



R/W R/W R/W 



NOTE: XX = Reserved bit, read as 0. 
R = read, W = write. 



Figure 9-3. Timer Global Control Register 



Table 9-2 shows the result of a write using specified values of the GO and HLD 
bits in the global control register. 

Table 9-2. Result of a Write of Specified Values of GO and HLD 



GO 


HLD 


RESULT 








All timer operations are held. No reset is performed. 





1 


Timer proceeds from state before write. 


1 





All timer operations are held, including zeroing of the counter. The GO 
bit is not cleared until the timer is taken out of hold. 


1 


1 


Timer reset and started. 



9.1.2 Timer Period and Counter Registers 

The 32-bit timer period register is used to specify the frequency of the timer 
signalling. The timer counter register is a 32-bit register, which is reset to zero 
whenever it increments to the value of the period register. Both registers are 
set to at reset. 

Certain boundary conditions affect timer operation, such as a zero in the pe- 
riod register and overflowing the counter. These conditions are listed as fol- 
lows: 

o When the period and counter _registers are zero, the operation of_the 
timer is dependent upon the C/P mode selected. In pujse mode (C/P = 
0), TSTAT is set and remains set. In clock mode (C/P = 1 ), the width 
of the cycle is 2/f(H1 ) and the external clocks are ignored. 

• When the counter register is not and the period register - 0, the 
counter will count, roll over to 0, and then behave as described above. 

• When the counter register is set to a value greater than the period regis- 
ter, the counter may overflow when being incremented. Once the 
counter reaches its maximum 32-bit value (OFFFFFFFFh), it simply 
clocks over to and continues. 

Writes from the peripheral bus override register updates from the counter or 
new status updates to the control register. 
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9.1.3 Timer Pulse Generation 



The timer pulse generator (see Figure 9-1 ) can generate several different ex- 
ternal signals. These signals may be inverted with the INV bit. The two basic 
modes are pulse mode and clock mode, as shown in Figure 9-4. In both 
modes, an internal clock source has a frequency of f(H1)/2, and an external 
clock source has a maximum frequency less than f(H1)/2. Refer to timer 
timing in Appendix A. In pulse mode (C/P = 0), the width of the pulse is 
1/f(H1). 



2/f(H1) 




1/f{H1) 



Tl 



■*4-i 



r 



i-1/f(CLKSRC) 



■period register/f(CLKSRC)- 



(a) TSTAT AND TIMER OUTPUT {INV = 0) WHEN C/P = {PULSE MODE) 




J 



i I 



■1/f(CLKSRC) 
-2/f(H1) 



r 



M- period register/f(CLKSRC) ■ 

U 2 X period register/f {CLKSRO ■ 



{b) TSTAT AND TIMER OUPUT {INV = 0) WHEN C/P = 1 {CLOCK MODE) 



Figure 9-4. Timer Timing 
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The rate of timer signaling is determined by the frequency of the timer input 
clock and the period register. The following equations are valid with either 
an internal or an external timer clock: 



f(pulse mode) = f(timer clock)/period register 
f(clock mode) = f(timer clock)/(2 x period register) 



9.1.4 Timer Operation Modes 



The timer can receive its input and send its output in several different modes, 
depending upon the setting of CLKSRC, FUNC, and I/O. The four timer 
modes of operation are defined as follows: 

O If CLKSRC = 1 and FUNC = 0, the timer input comes from the internal 
clock. The internal clock is not affected by the INV bit. In this mode, 
TCLK is connected to the I/O port control and can be used as a gener- 
al-purpose I/O pin (see Figure 9-5). If T/0 = 0, TCLK is configured as 
a general-purpose input pin whose state can be read in DATIN. DAT- 
OUT has no effect on TCLK or DATIN. If 1/0 = 1, TCLK is configured 
as a general-purpose output pin. DATOUT is placed on TCLK and can 
be read in DATIN. 



INTERNAL 
DATOUT(NC) O 



1 
DATIN 

i/0 = 
(a) 

INTERNAL 
DATOUT «" 0- 



EXTERNAL 
TCLK 



I 

I EXTERNAL 

-. TCLK 



DATIN 

i/0 = 1 
(b) 

Figure 9-5. Timer I/O Port Configurations 



If CLKSRC = 1 and FUNC = 1, the timer input comes from the internal 
clock and the timer output goes to TCLK. This value may be inverted 
using INV, and the value output on TCLK can be read in DATIN. 

If CLKSRC = and FUNC = 0, the timer is driven according to the sta- 
tus of the I/O bit. If T/0 = 0, the timer input comes from TCLK. This value 
can be inverted using INV, and the value of TCLK can be read in DATIN. 



9-7 



Peripherals - Timers 



If T/0 = 1, TCLK is an output pin. Then TCLK and the timer are both 
driven by DATOUT. All to 1 transitions of DATOUT increment the 
counter. INV has no effect on DATOUT. The value of DATOUT can be 
read in DATIN. 

If CLKSRC = and FUNC = 1, TCLK drives the timer. If INV = 0, all 
to 1 transitions of TCLK increment the counter. If INV = 1, alM to 
transitions of TCLK increment the counter. The value of TCLK can be 
read in DATIN. 



Figure 9-6 shows the four timer modes of operation. 



TIMER 



TIMER IN 
TIMER OUT 



internal 

internal 
"clock 



TSTAT 



I/O PORT 
CONTROL 



EXTERNAL 



-»-TCLK 



CLKSRC = 1 (INTERNAL) 
FUNC = (I/O PIN) 
(a) 



TIMER 



TIMER IN 
TIMER OUT 



INTERNAL 

INTERNAL 
CLOCK 



EXTERNAL 



TCLK 



TSTAT 



DATIN 



CLKSRC = 1 (INTERNAL) 
FUNC = 1 (TIMER PIN) 

(b) 




TIMER 


INTERNAL 


. 1 EX 

1 


TIMER IN 
TIMER OUT 










, 


1 ^ 

1 
1 


V ( 




TSTAT 


I/O PORT 
CONTROL 





EXTERNAL 
TCLK 



TIMER 



INTERNAL I EXTERNAL 



TIMER IN 
TIMER OUT 



TSTAT 



DATIN 



CLKSRC = (EXTERNAL) 
FUNC = (I/O PIN) 

(0 



CLKSRC = (EXTERNAL) 
FUNC = 1 (TIMER PIN) 

(d) 



■t-*-TCLK 

I 
I 



Figure 9-6. Timer Modes as Defined by CLKSRC and FUNC 
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9.2 Serial Ports 



The two TMS320C30 serial ports are totally independent. Both serial ports 
are identical with a complementary set of control registers in each one. Each 
serial port can be configured to transfer 8, 16, 24, or 32 bits of data per word. 
The clock for each serial port can originate either internally or externally. An 
internally generated clock is a divide-down of the clockout frequency (HI). 
A continuous transfer mode is available which allows the serial port to transmit 
and receive any number of words without new synchronization pulses. 

Eight memory-mapped registers are provided for each serial port. They are: 

Port global control register 

O Two port control registers for the six I/O pins 

• Three port receive/transmit timer registers 

• Data transmit register 
9 Data receive register 

The global control register controls the global functions of the serial port and 
determines the serial port operating mode . Two port control registers control 
the functions of the six serial port pins. The transmit buffer contains the next 
complete word to be transmitted. The receive buffer contains the last com- 
plete word received. Three additional registers are associated with the 
transmit/receive sections of the serial port timer. A serial port block diagram 
is shown in Figure 9-7, and the memory map of a serial port is shown in Figure 
9-8. 
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■RECEIVE SECTION- 



r 



RECEIVE 
TIMER (16) 



CLKR 
TSTAT 



■TRANSMIT SECTION 
CLKX 



CLKR 



CLKX 



TSTAT 



RINT 



n 



BIT COUNTER 
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RECEIVE 
CLOCK 
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Figure 9-7. Serial Port Block Diagram 



9-10 



Peripherals - Serial Ports 



Register 



PORT GLOBAL CONTROL REGISTER 



RESERVED 



FSX/DX/CLKX PORT CONTROL REGISTER 



FSR/DR/CLKR PORT CONTROL REGISTER 



R/X TIMER CONTROL REGISTER 



R/X TIMER COUNTER REGISTER 



R/X TIMER PERIOD REGISTER 



RESERVED 



DATA TRANSMIT REGISTER 



RESERVED 



RESERVED 



RESERVED 



DATA RECEIVE REGISTER 



RESERVED 



RESERVED 



RESERVED 
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Serial 
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Port 1 


808040h 


808050h 


808041 h 


808051 h 


808042h 


808052h 


808043h 


808053h 


808044h 


808054h 


808045h 


808055h 


808046h 


808056h 


808047h 


808057h 


808048h 


808058h 


808049h 


808059h 
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80804Ch 
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Figure 9-8. Memory-Mapped Locations for the Serial Port 



9.2.1 Serial Port Global Control Register 

The serial port global control register is a 32-bit register that contains the 
global control bits for the serial port. Table 9-3 defines the register bits, bit 
names, and bit functions. The register is shown in Figure 9-9. 
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Table 9-3. Serial Port Global Control Register Bits Summary 



BIT 


NAME 


FUNCTION 





RRDY 


If RRDY = 1, the receive buffer has new data and is ready to be read. A three 
H1/H3 cycle delay occurs from the reading of DRR to RRDY = 1. The rising edge 
of this signal sets RINT. If RRDY = 0, the receive buffer does not have new data 
since the last read. RRDY is set to at reset and after the receive buffer is read. 


1 


XRDY 


If XRDY = 1 , the transmit buffer has written the last bit of data to the shifter and 
is ready for a new word. A three H1/H3 cycle delay occurs from the loading of 
the transmit shifter to XRDY being set to 1. The rising edge of this signal sets 
XINT. If XRDY = 0, the transmit buffer has not written the last bit of data to the 
transmit shifter and is not ready for a new word. XRDY is set to 1 at reset. 


2 


FSXOUT 


This bit configures the FSX pin as an input (FSXOUT = 0) or an output (FSXOUT 
= 1). 


3 


XSREMPTY 


If XSREMPTY = 0, the transmit shift register is empty. If XSREMPTY = 1, the 
transmit shift register is not empty. This bit is set to at reset or by an XRESET. 


4 


RSRFULL 


If RSRFULL = 1, an overrun of the receiver has occurred. In continuous mode, 
RSRFULL is set to 1 when both RSR and DRR are full. In noncontinuous mode, 
RSRFULL is set to 1 when RSR and DRR are full and a new FSR is received. A 
read causes this bit to be set to 0. This bit can only be set to by a system reset, 
a serial port receive reset (RRESET = 1 ), or a read. When the receiver tries to set 
RSRFULL to a 1 at the same time that the global register is read, the receiver will 
dominate and RSRFULL is set to 1 . If RSRFULL = 0, no overrun of the receiver 
has occurred. 


5 


HS 


If HS = 1, the handshake mode is enabled. If HS = 0, the handshake mode is 
disabled. 


6 


XCLKSRCE 


If XCLKSRCE = 1, the internal transmit clock is used. If XCLKSRCE = 0, the 
external transmit clock is used. 


7 


RCLKSRCE 


If RCLKSRCE = 1, the internal receive clock is used. If RCLKSRCE = 0, the ex- 
ternal receive clock is used. 


8 


XVAREN 


This bit specifies fixed (XVAREN = 0) or variable (XVAREN = 1) data rate sig- 
nalling when transmitting. With a fixed data rate, FSX is active for at least one 
XCLK cycle, and then goes inactive before transmission begins. With variable 
data rate, FSX is active while all bits are being transmitted. When using an ex- 
ternal FSX and variable data rate signaling, the DX pin is driven by the transmitter 
when FSX is held active or when a word is being shifted out. 


9 


RVAREN 


This bit specifies fixed (RVAREN = 0) or variable (RVAREN = 1) data rate sig- 
nalling when receiving. With a fixed data rate, FSR is active for at least one RCLK 
cycle, and then goes inactive before the reception begins. With variable data rate, 
FSR is active while all bits are being received. 


10 


XFSM 


Transmit frame sync mode. Configures the port for continuous mode operation 
(XFSM = 1 ) or standard mode (XFSM = 0). In continuous mode, only the first 
word of a block generates a sync pulse, and the rest are simply transmitted con- 
tinuously to the end of the block. In standard mode, each word has an associated 
sync pulse. 


11 


RFSM 


Receive frame sync mode. Configures the port for continuous mode (RFSM = 
1) or standard mode (RFSM = 0) operation. In continuous mode, only the first 
word of a block generates a sync pulse, and the rest are simply received contin- 
uously without expectation of another sync pulse. In standard mode, each word 
received has an associated sync pulse. 


12 


CLKXP 


CLKX polarity. If CLKXP = 0, CLKX is active high. If CLKXP = 1 , CLKX is active 
low. 
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Table 9-3. Serial Port Global Control Register Bits Summary (Concluded) 



BIT 


NAME 


FUNCTION 


13 


CLKRP 


CLKR polarity. If CLKRP = 0, CLKR is active high. If CLKRP = 1, CLKR is active 
low/. 


14 


DXP 


DX polarity. If DXP = 0, DX is active high. If DXP = 1, DX is active low. 


15 


DRP 


DR polarity. If DRP = 0, DR is active high. If DRP = 1 , DR is active low. 


16 


FSXP 


FSX polarity. If FSXP = 0, FSX is active high. If FSXP = 1 , FSX is active low. 


17 


FSRP 


FSR polarity. If FSRP = 0, FSR is active high, if FSRP = 1, FSR is active low. 


18-19 


XLEN 


This bit defines the word length of serial data transmitted. All data is assumed 
to be right-justified in the transmit buffer when fewer than 32 bits are specified. 

-- 8 bits 1 -- 24 bits 

1 — 16 bits 1 1 —32 bits 


20-21 


RLEN 


This bit defines the word length of serial data received. All data is right-justified 
in the receive buffer. 

--- 8 bits 1 --- 24 bits 
1 — 16 bits 1 1 —32 bits 


22 


XTINT 


Transmit timer interrupt enable. If XTINT = 0, the transmit timer interrupt is dis- 
abled. If XTINT = 1, the transmit timer interrupt is enabled. 


23 


XI NT 


Transmit interrupt enable. If XINT = 0, the transmit interrupt is disabled. If XINT 
= 1 , the transmit interrupt is enabled. Note that the CPU transmit interrupt flag 
XINT is the logical OR of the enabled transmit timer interrupt and the enabled 
transmit interrupt. 


24 


RTINT 


Receive timer interrupt enable. If RTINT = 0, the receive timer interrupt is disa- 
bled. If RTINT = 1, the receive timer interrupt is enabled. 


25 


RINT 


Receive interrupt enable. If RINT = 0, the receive interrupt is disabled. If RINT 
= 1 , the receive interrupt is enabled. Note that the CPU receive interrupt flag 
RINT is the OR of the enabled receive timer interrupt and the enabled receive 
interrupt 


26 


XRESET 


Transmit reset. If XRESET = 0, the transmit side of the serial port is reset. To take 
the transmit side of the serial port out of reset, XRESET should be set to 1. 
However, XRESET should not be set to 1 until at least three cycles after XRESET 
goes inactive. This applies only to system reset. Setting XRESET to does not 
change the contents of any of the serial port control registers. It places the 
transmitter in a state corresponding to the beginning of a frame of data. Resetting 
the transmitter generates a transmit interrupt. This bit should be set at the same 
time the mode of the transmitter is set. XFSM can be toggled without resetting 
the global control register. 


27 


RRESET 


Receive reset. If RRESET = 0, the receive side of the serial port is reset. To take 
the transmit side of the serial port out of reset, XRESET should be set to 1 . Set- 
ting RRESET to does not change the contents of any of the serial port control 
registers. It places the receiver in a state corresponding to the beginning of a 
frame of data. This bit should be set at the same time the mode of the receiver 
is set. RFSM can be toggled without resetting the global control register. 


28-31 


Reserved 


Read as 0. 
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Figure 9-9. Serial Port Global Control Register 



9.2.2 FSX/DX/CLKX Port Control Register 

This 32-bit port control register controls the function of the serial port FSX, 
DX, and CLKX pins. At reset, all bits are set to 0. Table 9-4 defines the register 
bits, bit names, and functions. Figure 9-10 shows this port control register. 

Table 9-4. FSX/DX/CLKX Port Control Register Bits Summary 




BIT 


NAME 


FUNCTION 





CLKXFUNC 


CLKXFUNC controls the function of CLKX. If CLKXFUNC 
= 0, CLKX is configured as a general-purpose digital I/O 
port. If CLKXFUNC = 1, CLKX is a serial port pin. 


1 


CLKXT/0 


If CLKXT/0 = 0, CLKR is configured as a general-purpose 
input pin. If CLKXT/0 = 1, CLKX is configured as a gener- 
al-purpose output pin. 


2 


CLKXDATOUT 


Data output on CLKX. 


3 


CLKXDATIN 


Data input on CLKX. A write has no effect. 


4 


DXFUNC 


DXFUNC controls the function of DX. If DXFUNC = 0, DX 
is configured as a general-purpose digital I/O port. If 
DXFUNC = 1, DX is a serial port pin. 


5 


DXT/0 


If DXT/0 = 0, DX is configured as a general-purpose input 
pin. If DXT/0 = 1, DX is configured as a general-purpose 
output pin. 


6 


DXDATOUT 


Data output on DX. 


7 


DXDATIN 


Data input on DX. A write has no effect. 


8 


FSXFUNC 


FSXFUNC controls the function of FSX. If FSXFUNC = 0, 
FSX is configured as a general-purpose digital I/O port. If 
FSXFUNC = 1, FSX is a serial port pin. 


9 


FSXT/0 


If FSXT/O = 0, FSX is configured as a general-purpose input 
pin. If FSXT/O = 1, FSX is configured as a general-purpose 
output pin. 


10 


FSXDATOUT 


Data output on FSX. 


11 


FSXDATIN 


Data input on FSX. A write has no effect. 


12-31 


Reserved 


Read as 0. 
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R = read, W = write. 



Figure 9-10. FSX/DX/CLKX Port Control Register 



9.2.3 FSR/DR/CLKR Port Control Register 

This 32-bit port control register is controlled by the function of the serial port 
FSR, DR, and CLKR pins. At reset, all bits are set to 0. Table 9-5 defines the 
register bits, the bit names, and functions. Figure 9-11 illustrates this port 
control register. 

Table 9-5. FSR/DR/CLKR Port Control Register Bits Summary 



BIT 


NAME 


FUNCTION 





CLKRFUNC 


CLKRFUNC controls the function of CLKR. If CLKRFUNC 
= 0, CLKR is configured as a general-purpose digital I/O 
port. If CLKRFUNC = 1, CLKR is a serial port pin. 


1 


CLKRT/0 


If CLKRT/0 = 0, CLKR is configured as a general-purpose 
input pin. If CLKRT/0 = 1, CLKR is configured as a gener- 
al-purpose output pin. 


2 


CLKRDATOUT 


Data output on CLKR. 


3 


CLKR DATIN 


Data input on CLKR. A write has no effect. 


4 


DRFUNC 


DRFUNC controls the function of DR. If DRFUNC = 0, DR 
is configured as a general-purpose digital I/O port. If 
DRFUNC = 1, DR is a serial port pin. 


5 


DRT/0 


If DRT/0 = 0, DR is configured as a general-purpose input 
pin. If DRT/0 = 1 , TCLK is configured as a general-purpose 
output pin. 


6 


DR DATOUT 


Data output on DR. 


7 


DRDATIN 


Data input on DR. A write has no effect. 


8 


FSRFUNC 


FSRFUNC controls the function of FSR. If FSRFUNC = 0, 
FSR is configured as a general-purpose digital I/O port. If 
FSRFUNC = 1, FSR is a serial port pin. 


9 


FSRT/0 


If FSRT/0 = 0, FSR is configured as a general-purpose input 
pin. If FSRT/0 = 1 , FSR is configured as a general-purpose 
output pin. 


10 


FSR DATOUT 


Data output on FSR. 


11 


FSR DATIN 


Data input on FSR. A write has no effect. 


12-31 


Reserved 


Read as 0. 
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Figure 9-11. FSR/DR/CLKR Port Control Register 



9.2.4 Receive/Transmit Timer Control Register 

A 32-bit receive/transmit timer control register contains the control bits for the 
timer module. At reset, all bits are set to 0. Table 9-6 lists the register bits, bit 
names, and functions. Bits 5-0 control the transmitter timer. Bits 11-6 control 
the receiver timer. Figure 9-12 shows the register. 
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Table 9-6. Receive/Transmit Timer Control Register 



BIT 


NAME 


FUNCTION 





XGO 


The XGO bit resets and starts the transmit tinner counter. When 
XGO is set to 1 and the timer is not held, the counter is zeroed 
and begins incrementing on the next rising edge of the timer 
input clock. The XGO bit is cleared on the same rising edge. 
Writing to XGO has no effect on the transmit timer. 


1 


XHLD 


Transmit counter hold signal. When this bit is set to 0, the 
counter is disabled and held in its current state. The internal 
divide-by-two counter is also held so that the counter will 
continue where it left off when XHLD is set to 1. The timer 
registers may be read and modified while the timer is being 
held. RESET has priority over XHLD. 


2 


XC/P 


XCIock/Pulse mode control. When XC/P = 1, the clock mode 
is chosen. The signalling of the status fla£ and external output 
has a 50-percent duty cycle. When XC/P = 0, the status flag 
and external output are active for one CLKOUT cycle during 
each timer period. 


3 


XCLKSRC 


This bit specifies the source of the transmit timer clock. When 
XCLKSRC = 1, an internal clock with frequency equal to one- 
half the CLKOUT frequency is used to increment the counter. 
When XCLKSRC = 0, an external signal from the CLKX pin can 
be used to increment the counter. The external clock source is 
synchronized internally, thus allowing for external asynchro- 
nous clock sources that do not exceed the specified maximum 
allowable external clock frequency, i.e., less than f(H1 )/2.6. 


4 


Reserved 


Read as zero. 


5 


XTSTAT 


This bit indicates the status of the receive timer. It tracks what 
would be the output of the uninverted CLKX pin. This flag sets 
a CPU interrupt on a transition from to 1. A write has no 
effect. 


6 


RGO 


The RGO bit resets and starts the receive timer counter. When 
RGO is set to 1 and the timer is not held, the counter is zeroed 
and begins incrementing on the next rising edge of the timer 
input clock. The RGO bit is cleared on the same rising edge. 
Writing to RGO has no effect on the receive timer. 


7 


RHLD 


Receive counter hold signal. When this bit is set to 0, the 
counter is disabled and held in its current state. The internal 
divide-by-two counter is also held so that the counter will 
continue where it left off when RHLD is set to 1. The timer 
registers may be read and modified while the timer is being 
held. RESET has priority over RHLD. 
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Table 9-6. Receive/Transmit Timer Control Register (Concluded) 



BIT 


NAME 


FUNCTION 


8 


RC/P 


RCIock/Pulse mode control. When RC/P = 1, the clock mode 
is chosen. The signalling of the status flag and external output 
has a 50-percent duty cycle. When RC/P = 0, the status flag 
and external output are active for one CLKOUT cycle during 
each timer period. 


9 


RCLKSRC 


This bit specifies the source of the receive timer clock. When 
RCLKSRC = 1, an internal clock with frequency equal to one- 
half the CLKOUT frequency is used to increment the counter. 
When RCLKSRC = 0, an external signal from the CLKR pin can 
be used to increment the counter. The external clock source is 
synchronized internally, thus allowing for external asynchro- 
nous clock sources that do not exceed the specified maximum 
allowable external clock frequency, i.e., less than f(H1)/2.6. 


10 


Reserved 


Read as zero. 


11 


RTSTAT 


This bit indicates the status of the receive timer. It tracks what 
would be the output of the uninverted CLKR pin. This flag sets 
a CPU interrupt on a transition from to 1 . A write has no 
effect. 


12-31 


Reserved 


Read as 0. 
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Figure 9-12. Receive/Transmit Timer Control Register 



9.2.5 Receive/Transmit Timer Counter Register 

The timer counter register is a 32-bit register (see Figure 9-1 3). Bits 1 5-0 are 
the transmit timer counter, and bits 31 -16 are the receive timer counter. Each 
counter is set to whenever it increments to the value of the counter. It is also 
set to at reset. 



31 




16 


RECEIVE COUNTER 


15 







TRANSMIT COUNTER 



NOTE: All bits are read/write. 

Figure 9-13. Receive/Transmit Timer Counter Register 
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9.2.6 Receive/Transmit Timer Period Register 

The timer period register is a 32-bit register (see Figure 9-14) Bits 15-0 are 
the timer transmit period, and bits 31 -16 are the receive period. Each register 
is used to specify the period of the timer. It is also set to at reset. 



31 




16 


1 RECEIVE PERIOD | 


15 







1 TRANSMIT PERIOD | 



NOTE: All bits are read/write. 
Figure 9-14. Receive/Transmit Timer Period Register 



9.2.7 Data Transmit Register 



When the data transmit register (DXR) is loaded, the transmitter loads the 
word into the transmit shift register (XSR), and the bits are shifted out. The 
delay from a write to DXR until an FSX occurs (or can be accepted) is two 
CLKX cycles. The word is not loaded into the shift register until the shifter is 
empty. When DXR is loaded into XSR, the XRDY bit is set, specifying that the 
buffer is available to receive the next word. Four tap points within the transmit 
shift register are used to transmit the word. These tap points correspond to 
the four data word sizes and are illustrated in Figure 9-15. The shift is a left- 
shift (LSB to MSB) with the data shifted out of the MSB corresponding to the 
appropriate tap point. 



31 



24 23 



shift direction 
16 15 
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32-bit 
word tap 



24-bit 
word tap 



16-bit 
word tap 



8-bit 
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Figure 9-15. Transmit Buffer Shift Operation 




9.2.8 Data Receive Register 



When serial data is input, the receiver shifts the bits into the receive shift reg- 
ister (RSR). When the specified number of bits are shifted in, the data receive 
register (DRR) is loaded from RSR and the RRDY status bit is set. The receiver 
is double-buffered. If the DRR has not been read and the RSR is full, the re- 
ceiver is frozen. New data coming into the DR pin is ignored. The receive 
shifter will not write over the DRR. The DRR must be read to allow new data 
in the RSR to be transferred to the DRR. When a write to DRR occurs at the 
same time that a RSR to DRR transfer takes place, the RSR to DRR transfer 
has priority. 

Data is shifted to the left (LSB to MSB). Figure 9-16 illustrates what happens 
when words less than 32 bits are shifted into the serial port. In this figure, it 
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is assumed that an 8-bit word is being received and that the upper three bytes 
of the receive buffer are originally undefined. In the first portion of the figure, 
byte a has been shifted in. When byte b is shifted in, byte a is shifted to the 
left. When the data receive register is read, both bytes a and b are read. 
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After Byte b 
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Figure 9-16. Receive Buffer Shift Operation 



9.2.9 Serial Port Operation Configurations 

Several configurations are provided for the operation of the serial port clocks 
and timer. The clocks for each serial port can originate either internally or ex- 
ternally. Figure 9-17 shows serial port clocking in the I/O mode (FUNC = 0) 
when CLKX is either an input or an output. Figure 9-1 8 shows clocking in the 
serial port mode (FUNC = 1). Both figures use a transmit section for an ex- 
ample. The same relationship holds for a receive section. 
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Figure 9-17. Serial Port Clocking in I/O Mode 
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Figure 9-18. Serial Port Clocking in Serial Port Mode 
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9.2.10 Serial Port Timing 



The formula for calculating the frequency of the serial port clock with an in- 
ternally generated clock is dependent upon the operation mode of the serial 
port timers, defined as: 

f (pulse mode) = f (timer ciock)/period register 
f (clock mode) = f (timer clock)/(2 x period register) 

An externally generated serial port clock (CLKX or CLKR) has a maximum 
frequency less than f(H1 )/2.6. See serial port timing in Appendix A. 

Transmit data is clocked out on the rising edge of the selected serial port clock. 
Receive data is latched into the receive shift register on the falling edge of the 
serial port clock. All data is transmitted and loaded MSB first and right-justi- 
fied. If less than 32 bits are transferred, the data is right-justified in the 32-bit 
transmit and receive buffers. Therefore, the LSBs of the transmit buffer are the 
bits that are transmitted. 

The transmit ready (XRDY) signal specifies that the data transmit register 
(DXR) is available to be loaded with new data. XRDY goes active as soon 
as the data is loaded into the transmit shift register (XSR). The last word may 
still be shifting out when XRDY goes active. If DXR is loaded before the last 
word has completed transmission, the data bits transmitted will be consec- 
utive, i.e., the LSB of the first word immediately precedes the MSB of the 
second, with all signalling valid as in two separate transmits. XRDY goes in- 
active when DXR is loaded, and remains inactive until the data is loaded into 
the shifter. 

The receive ready (RRDY) signal is active as long as a new word of data is 
loaded into the data receive register and has not been read. As soon as the 
data is read, the RRDY bit is turned off. 

When FSX is specified as an output, the activity of the signal is determined 
solely by the internal state of the serial port. When a fixed data rate is speci- 
fied, FSX goes active when DXR is loaded into XSR to be transmitted out. 
One serial clock cycle later, FSX turns inactive and data transmission begins. 
When a variable data rate is specified, the FSX pin is activated when the data 
transmission begins, and remains active during the entire transmission of the 
word. Again, the data is transmitted one clock cycle after it is loaded into the 
data transmit register. 

An input FSX in the fixed data rate mode should go active for at least one se- 
rial clock cycle and then inactive to initiate the data transfer. The transmitter 
then transmits the number of bits specified by the LEN bits. In the variable 
data rate mode, the transmitter begins transmitting as soon as FSX goes active 
until the number of specified bits has been shifted out. In the variable data 
rate mode, when the FSX status changes prior to all the data bits being shifted 
out, the transmission completes and the DX pin is placed in a high impedance 
state. An FSR input is exactly complementary to the FSX. 

When using an external FSX, if DXR and XSR are empty, a write to DXR re- 
sults in a DXR to XSR transfer. This data is held in the XSR until an FSX oc- 
curs. When the external FSX is received, the XSR begins shifting the data. If 
XSR is waiting for the external FSX, a write to DXR will change DXR, but a 
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DXR to XSR transfer will not occur. XSR begins shifting when the external 
FSX is received, or when reset using XRESET. 

Continuous Transmit and Receive Modes 

When continuous mode is chosen, consecutive writes do not generate or ex- 
pect new sync pulse signalling. Only the first word of a block begins with an 
active synchronization. Thereafter, data continues to be transmitted as long 
as new data is loaded into DXR before the last word has been transmitted. 
As soon as TXRDY is active and all of the data has been transmitted out of the 
shift register, the DX pin is placed in a high impedance state, and a subsequent 
write to DXR initiates a new block and a new FSX. 

Similarly with FSR, the receiver continues shifting in new data and loading 
DRR. If the data receive buffer is not read before the next word is shifted in, 
subsequent incoming data will be lost. The RFSM bit can be used to termi- 
nate the receive continuous mode. 
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Handshake Mode 

The handshake mode (HS = 1) allows for direct connection between proces- 
sors. In this mode, all data words are transmitted with a leading 1 (see Figure 
9-19). For example, if an 8-bit word is to be transmitted, the first bit sent is 
a 1 , followed by the 8-bit data word. 

In this mode, once the serial port transmits a word, it will not transmit another 
word until it receives a separately transmitted zero bit. Therefore, the 1 bit that 
precedes every data word is, in effect, a request bit. 



. data word 

r (8-b!ts) 



DX- 



-(Dcooooooa 

V 

>- leading one 
Figure 9-19. Data Word Format in Handshake Mode 



After a serial port receives a word (with the leading 1 ), and it has been read 
from the DRR, it sends a single to the transmitting serial port. Thus, the 
single bit acts as an acknowledge bit (see Figure 9-20). This single ac- 
knowledge bit is sent every time the DRR is read, even if the DRR does not 
contain new data. 




DX- 



<I> 



■single zero 
Figure 9-20. Single Zero Sent as an Acknowledge 



When the serial port is placed in the handshake mode, the insertion and de- 
letion of a leading 1 for transmitted data, the sending of a for acknowl- 
edgement of received data, and the waiting for this acknowledge bit are all 
performed automatically. Using this scheme, it is simple to connect processors 
with no external hardware and guarantee secure communication. A typical 
configuration is shown in Figure 9-21 . 

In the handshake mode, FSX is automatically configured as an output. Con- 
tinuous mode is automatically disabled. After a system reset or XRESET, the 
transmitter is always permitted to transmit. The transmitter and receiver must 
be reset when entering the handshake mode. 
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Figure 9-21. Direct Connection Using Handshalte Mode 




9.2.11 Serial Port Interrupt Sources 

A serial port has four interrupt sources: 

1 ) The transmit timer interrupt: The rising edge of XTSTAT causes a single 
cycle interrupt pulse to occur. When XTINT is 0, this interrupt pulse is 
disabled. 

2) The receive timer interrupt: The rising edge of RTSTAT causes a single 
cycle interrupt pulse to occur. When RTINT is 0, this interrupt pulse is 
disabled. 

3) The transmitter interrupt: Occurs immediately following a DXR to XSR 
transfer. The transmitter interrupt is a single cycle pulse. When the 
global serial-port control register XINT is 0, this interrupt pulse is disa- 
bled. 

4) The receiver interrupt: Occurs immediately following a RSR to DRR 
transfer. The receiver interrupt is a single cycle pulse. When the global 
serial-port control register RINT is 0, this interrupt pulse is disabled. 

The transmit timer interrupt pulse is ORed with the transmitter interrupt pulse 
to create the CPU transmit interrupt flag XINT. The receive timer interrupt 
pulse is ORed with the receiver interrupt pulse to create the CPU receive in- 
terrupt flag RINT. 

9.2.12 Serial Port Functional Operation 

The following paragraphs and figures illustrate the functional timing of the 
various serial port modes of operation. The timing descriptions are presented 
assuming that all signal polarities are configured to be positive, i.e. 
CLKXP=CLKRP= DXP=DRP=FSXP=FSRP=0. Logical timing, in situations 
where one or more of these polarities are inverted, is the same but with respect 
to the opposite polarity reference points, i.e. rising vs. falling edges, etc. 

These discussions pertain to the numerous operating modes and configura- 
tions of the serial port logic. When it is necessary to switch operating modes 
or chan ge co nfigurati ons of the serial port, this should be done only when 
XRESET or or RRESET are asserted (l ow) as appropriate. Therefore, when 
transmit configurations are modif ied, XRE SET should be low, and when re- 
ceive configurations are modified, RRESET should be low. When in handshake 
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mode, however, since the transmitter a nd recei ver a re interr elated, any con- 
figuration changes should be made with XRESET and RRESET both low. 

All of the various serial port operating configurations can be broadly classified 
in two categories: fixed data rate timing and variable data rate timing. The 
following paragraphs discuss fixed and variable data rate operation and all of 
their variations. 

Fixed Data Rate Timing Operation 

Fixed data rate serial port transfers can occur in two varieties: burst mode and 
continuous mode. In burst mode operation, transfers of single words are se- 
parated by periods of inactivity on the serial port. In continuous mode, there 
are no gaps between successive word tranfers, i.e., the first bit of a new word 
is transferred on the next CLKX/R pulse following the last bit of the previous 
word. This occurs continuously until the process is terminated. 

In burst mode with fixed data rate timing, FSX/FSR pulses initiate transfers, 
and each transfer involves a single word. With an internally generated FSX 
(see Figure 9-22), transmission is initiated by loading DXR. In this mode, 
there is an approximately 2.5 CLKX cycle delay (depending on CLKX and HI 
frequencies) from DXR being loaded until FSX occurs. With an external FSX, 
the FSX pulse initiates the transfer and the 2.5 cycle delay effectively becomes 
a setup requirement for loading DXR with respect to FSX. Therefore, in this 
case, DXR must be loaded no later than 3 CLKX cycles before FSX occurs. 
Once the XSR is loaded from the DXR, an XINT is generated. 



CLKX/R 

FSR/FSX 
(EXTERNAL) "" 

FSX 

(INTERNAL) 

DX/DR — 




--r-^ocj^ziy^'o- 



DXR 
LOADED 



Figure 9-22. Fixed Burst Mode 



In receive operations, once a transfer is initiated, FSR is ignored until the last 
bit. For burst mode transfers, FSR must be low during the last bit, or another 
transfer will be initiated. After a full word has been received and transfered to 
the DRR, an RINT is generated. 

In fixed data rate mode, continuous transfers may be performed even if 
R/XFSM=0 as long as properly timed frame synchronization is provided, or if 
DXR is reloaded each cycle (with an internally generated FSX), see Figure 
9-23. 
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XINT 
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XINT 




DXR 


RINT 
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LOAD DXR 


LOAD DXR 




READ DRR 


READ DRR 



Figure 9-23. Fixed Continuous Mode With Frame Synch 



For receive operations and with externally generated FSX, once transfers have 
begun, frame sync pulses are only required during the last bit transferred to 
initiate another contiguous transfer. Otherwise, frame sync inputs are ignored. 
Therefore, continuous transfers will occur if frame sync is held high. With an 
internally generated FSX, there is an approximately 2.5 CLKX cycle delay fol- 
lowing DXR being loaded before FSX occurs. This delay occurs each time 
DXR is loaded, therefore, during continuous transmission, the instruction 
which loads DXR must be executed by the N-3 bit, (for an N-bit trans- 
mission). Since delays due to pipelining may vary, a conservative margin of 
safety should be incorporated in accounting for this delay. 

Once the process begins, an XINT and an RINT are generated at the beginning 
of each transfer. The XINT indicates that the XSR has been loaded from DXR, 
and can be used to cause DXR to be reloaded. To maintain continuous trans- 
mission in this mode, especially with an interally generated FSX, DXR must 
be reloaded early in the ongoing transfer. 

The RINT indicates that a full word has been received and transferred into the 
DRR. RINT is therefore commonly used to indicate an appropriate time to read 
DRR. 

Continuous transfers are terminated by discontinuing frame sync pulses or, in 
the case of internally generated FSX, not reloading DXR. 

Continuous serial port transfers can be accomplished without the use of frame 
sync pulses if R/XFSM are set to one. In this mode, operation of the serial 
port is similar to continuous operation with frame sync except that a frame 
sync pulse is involved only in the first word transferred, and no further frame 
sync pulses are used. Following the first word transferred (see Figure 9-24), 
no internal frame sync pulses are generated, and frame sync inputs are ig- 
nored. Additionally, R/XFSM should be set prior to or during the first word 
transferred, and must be set no later than the transfer of the N-1 bit of the first 
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word, except for transmit operations. For transmit operations in the fixed data 
rate mode, XFSM must be set no later than the N-2 bit. Clearing R/XFSM 
must be performed no later than the N-1 bit to be recognized in the current 
cycle. 



CLKX/R 

FSR/FSX 
(EXTERNAL) 

FSX 

(INTERNAL) 

DR/DX 




■-^ r^r^ l — x-^fT^r-BT-y ^l — y-BN-x -ci-^r- 



DXR 
LOADED 



T 


f 
SET 


XINT 




f 
XINT 




R/XFSM 


RINT 




RINT 


DXR 


LOAD DXR 


LOAD DXR 


LOA 


3ED 


READ DRR 


READ DRR 



Figure 9-24. Fixed Continuous Mode Without Frame Synch 



Timing of RINT and XINT and data transfers to and from DXR and DRR, re- 
spectively, are the same as in fixed data rate continuous mode with frame sync. 
This mode of operation also exhibits the same 2.5 CLKX cycle delay following 
DXR being loaded before an internal FSX is generated. As in the case of 
continuous operation in fixed data rate mode with frame sync, DXR must be 
reloaded no later than transmission of the N-3 bit. 

When using continuous operation in fixed data rate mode, R/XFSM may be 
set and cleared as desired, even during active transfers, to enable or disable the 
use of frame sync pulses as dictated by system requirements. Under most 
conditions, the effect of changing the state of R/XFSM occurs during the 
transfer in which the R/XFSM change was made, provided the change was 
made early enough in the transfer. For transmit operations with internal FSX 
in fixed data rate mode, however, a one word delay occurs before frame sync 
pulse generation resumes when clearing XFSM to zero (see Figure 9-25). 
Therefore, one additional word is transferred in this case before the next FSX 
pulse is generated. Also note that, as discussed previously, clearing XFSM 
will be recognized during the current word being transmitted as long as XFSM 
is cleared no later than the N-1 bit. Setting XFSM is recognized as long as 
XFSM is set no later than the N-2 bit. 
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Figure 9-25. Exiting Fixed Continuous Mode Without Frame Synch, FSX Internal 



Variable Data Rate Timing Operation 

Variable data rate timing also supports operation in either burst or continuous 
mode. Burst mode operation with variable data rate timing is similar to burst 
mode operation with fixed data rate timing. With variable data rate timing (see 
Figure 9-26) however, FSX/R and data timing differs slightly at the beginning 
and end of transfers. Specifically, there are three major differences between 
fixed and variable data rate timing. 




FSR/FSX 
(EXTERNAL) , 

FSX 

(INTERNAL) ■ 



-rmc^} y--Air> - 



DXR 
LOADED 



Figure 9-26. Variable Burst Mode 



First, FSX/R pulses typically last for the entire transfer interval, although FSR 
and external FSX are ignored after the first bit transferred. FSX/R pulses in 
fixed data rate mode typically last only one CLKX/R cycle, but can last longer. 

Second, data transfer begins during the CLKX/R cycle in which FSX/R occurs, 
rather than the CLKX/R cycle following FSX/R, as is the case with fixed data 
rate timing. 
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Finally, with variable data rate timing, frame sync inputs are ignored until the 
end of the last bit transferred, rather than the beginning of the last bit trans- 
ferred as is the case with fixed data rate timing. 

When transmitting continuously in variable data rate mode with frame sync, 
timing is the same as for fixed data rate mode, besides the differences between 
these two modes as described under burst mode operation with variable data 
rate timing. The only exception to this is that when operating continuously 
in variable data rate mode (see Figure 9-27), DXR must be reloaded no later 
than the N-4 bit to maintain continuous operation, as opposed to the N-3 bit 
for fixed data rate mode. 



CLKX/R ■ 

FSR/FSX 
(EXTERNAL) . 

FSX 
(INTERNAL). 

DX/DR ' 



-<-^nc^i y-^N- y-Bi-Y ^^ — y-^u-y -on r-cr^ c— 



DXR 
LOADED 



LOAD 
DXR 



XINT 
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Figure 9-27. Variable Continuous Mode With Frame Synch 



Continuous operation in variable data rate mode without frame sync is also 
similar to continuous operation without frame sync in fixed data rale mode. 
As with variable data rate mode continuous operation with frame sync (see 
Figure 9-28), DXR must be reloaded no later than the N-4 bit to maintain 
continuous operation. Additionally, when R/XFSM is set or cleared in the 
variable data rate mode, the modification must be made no later than the N-1 
bit, for the result to be affected in the current transfer. 




9-31 



Peripherals - Serial Ports 



CLKX/R 

FSR/FSX 

(EXTERNAL) 

FSX 

(INTERNAL) 



•<KXt 



yt=yc 



•^ r-ci-x-c2-Y - 



DXR 
LOADED 



XINT I SET 

DXR R/XFSM 
LOADED 



XINT 

RINT 

LOAD DXR 
READ DRR 



XINT 

RINT 

LOAD DXR 
READ DRR 



Figure 9-28. Variable Continuous Mode Without Frame Synch 
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9.3 DMA Controller 



The TMS320C30 provides an on-chip Direct Memory Access (DMA) con- 
troller. The purpose of the DMA controller is to reduce the need for the CPU 
to perform input/output functions. The DMA controller can perform 
input/output operations without interfering with the operation of the CPU. 
Therefore, it is possible to interface the TMS320C30 to slow external memo- 
ries and peripherals (A/D's, serial ports, etc.) without reducing the computa- 
tional throughput of the CPU. The result is improved system performance and 
decreased system cost. 

A DMA transfer consists of two operations: a read from a memory location and 
a write to a memory location. The DMA controller can read from and write to 
any location in the TMS320C30 memory map. This includes all memory- 
mapped peripherals. The operation of the DMA is controlled with the fol- 
lowing set of memory-mapped registers: 

• DMA global control register 

• DMA source address register 

• DMA destination address register 
© DMA transfer counter register 

These registers, their memory-mapped addresses, and functions are shown in 
Figure 9-29. Each of these DMA registers will be discussed in the succeeding 
subsections. 



Register 


Peripheral 
Address 


DMA GLOBAL CONTROL 


808000h 


RESERVED 


808001 h 


RESERVED 


808002h 


RESERVED 


808003h 


DMA SOURCE ADDRESS 


808004h 


RESERVED 


808005h 


DMA DESTINATION ADDRESS 


808006h 


RESERVED 


808007h 


DMA TRANSFER COUNTER 


808008h 


RESERVED 


808009h 


RESERVED 


80800Ah 


RESERVED 


80800Bh 


RESERVED 


80800Ch 


RESERVED 


80800Dh 


RESERVED 


80800Eh 


RESERVED 


80800Fh 




Figure 9-29. IVIemory-Mapped Locations for a DMA Channel 
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9.3.1 DMA Global Control Register 

The global control register controls the state in which the DMA controller 
operates. This register also indicates the status of the DMA, which changes 
every cycle. Source and destination addresses can be incrennented, decre- 
mented, or synchronized using specified global control register bits. At system 
reset, all bits in the DMA control register are set to 0. Table 9-7 lists the reg- 
ister bits, names, and functions. Figure 9-30 shows the bit configuration of 
the global control register. 



Table 9-7. Global Control Register Bits 



BIT 


NAME 


FUNCTION 


0-1 


START 


These bits control the state in which the DMA starts and stops. 
The DMA may be stopped without any loss of data (see Table 9-8). 


2-3 


STAT 


These bits indicate the status of the DMA. These status bits change 
every cycle (see Table 9-9). 


4 


INCSRC 


If INCSRC = 1, the source address is incremented after every read. 


5 


DECSRC 


If DECSRC = 1, the source address is decremented after every read. 
If INCSRC = DECSRC, the source address is not modified after a 
read. 


6 


INCDST 


If INCDST = 1, the destination address is incremented after every 
write. 


7 


DECDST 


If DECDST = 1, the destination address is decremented after every 
read. If INCDST = DECDST, the destination address is not modi- 
fied after a write. 


8-9 


SYNCH 


The SYNCH bits determine the timing synchronization between the 
events initiating the source and the destination transfers. The in- 
terpretation of the SYNCH bits is shown in Table 9-10. 


10 


TC 


The TC bit affects the operation of the transfer counter. If TC = 0, 
transfers are not terminated when the transfer counter becomes 
zero. If TC = 1, transfers are terminated when the transfer counter 
becomes zero. 


11 


TCI NT 


If TCINT = 1,the DMA interrupt is set when the transfer counter 
makes a transition to zero. If TCINT = 0, the DMA interrupt is not 
set when the transfer counter makes a transition to zero. 


12 


Reserved 


Read as zero. 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX XX 


XX 


15 14 13 12 11 


10 


9 8 


7 


6 


5 


4 


3 2 10 


XX 


XX 


XX 


XX 


TCINT 


TC 


SYNCH 


DECDST 


INCDST 


DECSRC 


INCSRC 


STAT 


START 



R/W R/W R/W R/W R/W 



R/W 



R/W 



R/W R R R/W R/W 



NOTE: XX - Reserved bit, read as 0. 
R = read, W = write. 



Figure 9-30. DMA Global Control Register 
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Table 9-8. START Bits and Operation of the DMA 



START 


FUNCTION 


00 


DMA read or write cycles in progress will be completed, any data read will 
be ignored. Any pending read or write will be cancelled. The DMA is reset 
so that when started, a new transaction is begun; i.e., a read is performed. 


01 


If a read or write has begun, the read or write is completed before stopping, 
i.e. in the middle or at the end of a DMA transfer. If a read or write has 
not begun, no read or write is started. 


1 


If a DMA transfer has begun, the entire transfer is completed (including 
both read and write operations) before stopping. If a transfer has not be- 
gun, none is started. 


1 1 


DMA starts from reset or restarts from the previous state. 



Table 9-9. STAT Bits and Status of the DMA 



STAT 


FUNCTION 


00 


DMA is being held between DMA transfers (between a 
This is the value at reset. 


read and write). 


01 


DMA is being held in the middle of a DMA transfer, i.e. 
and a write. 


between a read 


1 


Reserved. 


1 1 


DMA busy; i.e., DMA is performing a read or write. 



Table 9-10. SYNCH Bits and Synchronization of the DMA 



SYNCH 


FUNCTION 


00 


No synchronization. Enabled interrupts are ignored. 


01 


Source synchronization. A read is performed when an enabled interrupt 
occurs. 


1 


Destination synchronization. A write is performed when an enabled inter- 
rupt occurs. 


1 1 


Source and destination synchronization. A read is performed when an en- 
abled interrupt occurs. A write is then performed when the next enabled 
interrupt occurs. 
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9.3.2 Destination and Source Address Registers 

The DMA destination and source address registers are 24-bit registers. These 
registers are used when performing the increment and decrement as specified 
by control bits DECSRC, INCSRC, DECDST, and INCDST of the DMA global 
control register. The contents of these registers specify the destination and 
source addresses. The registers are incremented or decremented at the end 
of the corresponding memory access. I.e., source register for a read, destina- 
tion register for a write. On system reset, is written to these registers. 

9.3.3 Transfer Counter Register 

The transfer counter register is a 24-bit register, controlled by a 24-bit counter 
that counts down. The counter decrements upon the completion of a DMA 
memory write. In this way, it can be used to control the size of a block of data 
transferred. The transfer counter register is set to at system reset. 

9.3.4 CPU/DMA Interrupt Enable Register 

The CPU/DMA interrupt enable register (IE) is a 32-bit register located in the 
CPU register file. The CPU interrupt enable bits are in locations 10-0. The 
DMA interrupt enable bits are in locations 26-1 6. A 1 in a CPU/DMA interrupt 
enable register bit enables the corresponding interrupt. A disables the cor- 
responding interrupt. At reset, is written to this register. 

Table 9-11 list the bits, names, and functions of the CPU/DMA interrupt en- 
able register. Figure 9-31 shows the IE register. The priority and decoding 
scheme of CPU and DMA interrupts is identical. Note that when the DMA 
receives an interrupt, this interrupt is acted upon based upon the SYNCH field 
of the DMA control register. Note that an interrupt may affect the DMA, but 
not the CPU and vice versa. Refer to Section 7. 
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Table 9-11. CPU/DMA Interrupt Enable Register Bits 



BIT 


NAME 


FUNCTION 





EINTO 


Enable external interrupt (CPU) 


1 


EINT1 


Enable external interrupt 1 (CPU) 


2 


EINT2 


Enable external interrupt 2 (CPU) 


3 


EINT3 


Enable external interrupt 3 (CPU) 


4 


EXINTO 


Enable serial port transmit interrupt (CPU) 


5 


ERINTO 


Enable serial port receive interrupt (CPU) 


6 


EXINT1 


Enable serial port 1 transmit interrupt (CPU) 


7 


ERINT1 


Enable serial port 1 receive interrupt (CPU) 


8 


ETINTO 


Enable timer interrupt (CPU) 


9 


ETINT1 


Enable timer 1 interrupt (CPU) 


10 


EDINT 


Enable DMA controller interrupt (CPU) 


11-15 


Reserved 


Read as 


16 


EINTO 


Enable external interrupt (DMA) 


17 


EINTI 


Enable external interrupt 1 (DMA) 


18 


EINT2 


Enable external interrupt 2 (DMA) 


19 


EINT3 


Enable external interrupt 3 (DMA) 


20 


EXINTO 


Enable serial port transmit interrupt (DMA) 


21 


ERINTO 


Enable serial port receive interrupt (DMA) 


22 


EX1NT1 


Enable serial port 1 transmit interrupt (DMA) 


23 


ERINT1 


Enable serial port 1 receive interrupt (DMA) 


24 


ETINTO 


Enable timer interrupt (DMA) 


25 


ETINT1 


Enable timer 1 interrupt (DMA) 


26 


EDINT 


Enable DMA controller interrupt (DMA) 


27-32 


Reserved 


Read as 



31 


30 29 28 27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


EDINTO 
(DMA) 


ETINT1 
(DMA) 


ETINTO 
(DMA) 


ERINT1 
(DMA) 


EXINT1 
(DMA) 


ERINTO 
(DMA) 


EXINTO 
(DMA) 


E!NT3 
(DMA) 


EINT2 
(DMA) 


EINTI 
(DMA) 


EINTO 
(DMA) 


R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 
1514131211 10 987654321 


R/W 



XX 


XX 


XX 


XX 


XX 


EDINTO 
(CPU) 


ETINT1 
(CPU) 


ETINTO 
(CPU) 


ERINTI 
(CPU) 


EXINT1 
(CPU) 


ERINTO 
(CPU) 


EXINTO 
(CPU) 


EINT3 
(CPU) 


EINT2 
(CPU) 


EINTI 
(CPU) 


EINTO 
(CPU) 



R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 



NOTE: XX = Reserved bit, read as 0. 
R = read, W = write. 



Figure 9-31. CPU/DMA Interrupt Enable Register 
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9.3.5 DMA Memory Transfer Operation 

Each DMA memory transfer consists of two parts: 

1 ) Read data from the address specified by the DMA source register. 

2) Write data that has been read to the address specified by the DMA des- 
tination register. 

A transfer is complete only when the read and write are complete. A transfer 
may be stopped by setting the START bits to the desired value. When the 
DMA is restarted (START = 1 1), it completes any pending transfer. 

At the end of a DMA read, the source address is modified as specified by the 
SRCINC and SRCDEC bits of the DMA global control register. At the end of 
a DMA write, the destination address is modified as specified by the DSTINC 
and DSTDEC bits of the DMA global control register. At the end of every 
DMA write, the DMA transfer counter is decremented. 

DMA on-chip reads and writes (reads and writes from on-chip memory and 
peripherals) are single cycle. DMA off-chip reads are two cycles. The first 
cycle is an internal setup with the external read beginning on the following 
cycle. The external read cycle is identifical to a CPU read cycle. DMA off-chip 
writes are identical to CPU off-chip writes. 

Through the 24-bit source and destination registers, the DMA is capable of 
accessing any memory-mapped location in the TMS320C30 memory map. 
Figure 9-32 through Figure 9-34 show the number of cycles a DMA transfer 
requires, depending upon whether the source and destination are on-chip 
memory and peripherals, the external port, or the I/O port. T represents the 
number of transfers to be performed. C, represents the number of wait states 
for the source read. C^ represents the number of wait states for the destination 
write. Each entry in the table represents the total cycles required to do the T 
transfers, assuming no pipeline conflicts. 

Accompanying each table is a figure illustrating the timing of the DMA trans- 
fer. |R| and |W| represent single-cycle reads and writes, respectively. |R.R1 
and |W.W| represent multicycle reads and writes. |Cr| and |Cw| show the 
number of wait cycles for a read and write. |-| represents the cycle used as 
an internal setup for DMA external reads. 
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Figure 9-32. Timing and Number of Cycles for DMA Transfers When Destination 

is On-Chip 
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Figure 9-33. DMA Timing When Destination is a Primary Bus 
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Figure 9-34. DMA Timing When Destination is an Expansion Bus 
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Table 9-12 shows the maximum DMA transfer-rates assuming no wait states 
(Cr=Cw= 0). Table 9-13 shows the maximum DMA transfer- rates assuming 
one wait state for the read (Cr=1) and no wait states for the write (Cw^O). 
Table 9-14 shows the maximum DMA transfer- rates assuming one wait state 
for the read (Q =1 ) and one wait state for the write (Cw ). 

In each table, the complete transfer is considered (i.e., the time to do the read 
and the write). Since one bus access is required for the read and another for 
the write, bus transfer- rates will be twice the transfer-rate. It is also assumed 
that no conflicts with the CPU exist. 

Table 9-12. Maximum DMA Transfer Rates When C/- = C^ = 



SOURCE 


INTERNAL 


DESTINATION 
PRIMARY 


EXPANSION 


INTERNAL 


33.3 Mbytes/sec 


33.3 Mbytes/sec 


33.3 Mbytes/sec 


PRIMARY 


22.2 Mbytes/sec 


16.7 Mbytes/sec 


22.2 Mbytes/sec 


EXPANSION 


22.2 Mbytes/sec 


22.2 Mbytes/sec 


16.7 Mbytes/sec 



Table 9-13. Maximum DMA Transfer Rates When Q = 1, Cn, = 



SOURCE 


INTERNAL 


DESTINATION 
PRIMARY 


EXPANSION 


INTERNAL 


33.3 Mbytes/sec 


33.3 Mbytes/sec 


33.3 Mbytes/sec 


PRIMARY 


16.7 Mbytes/sec 


1 3.3 Mbytes/sec 


1 6.7 Mbytes/sec 


EXPANSION 


16.7 Mbytes/sec 


16.7 Mbytes/sec 


13.3 Mbytes/sec 



Table 9-14. Maximum DMA Transfer Rates When C,- = 1, C^ = 1 



SOURCE 


INTERNAL 


DESTINATION 
PRIMARY 


EXPANSION 


INTERNAL 


33.3 Mbytes/sec 


22.2 Mbytes/sec 


22.2 Mbytes/sec 


PRIMARY 


16.7 Mbytes/sec 


11.1 Mbytes/sec 


16.7 Mbytes/sec 


EXPANSION 


16.7 Mbytes/sec 


16.7 Mbytes/sec 


11.1 Mbytes/sec 
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9.3.6 Synchronization of DMA Channels 

A DMA channel may be synchronized through the use of interrupts. Refer to 
Table 9-10 for the relationship between the SYNCH bits of the DMA global 
control register and the synchronization performed. This section describes the 
following four synchronization mechanisms: 

• No synchronization (SYNCH =0 0) 

• Source synchronization (SYNCH =01) 

• Destination synchronization (SYNCH = 10) 

• Source and destination synchronization (SYNCH. = 1 1) 
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No Synchronization 

When SYNCH = 0, no synchronization is performed. The DMA will perfornn reads and 
writes whenever there are no conflicts. All interrupts are ignored, and therefore can be 
considered to be globally disabled. However, no bits in the DMA interrupt enable register 
are changed. Figure 9-35 shows the synchronization mechanism when SYNCH =0 0. 



START 



DISABLE DMA INTERRUPTS GLOBALLY 



I 



DMA CHANNEL PERFORMS A READ 



DMA CHANNEL PERFORMS A WRITE 



GO TO START 



Figure 9-35. No DMA Synchronization 
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Source Synchronization 

When SYNCH = 1, the DMA is synchronized to the source (see Figure 9-36). A read 
will not be performed until an interrupt is received by the DMA. Then, all DMA interrupts 
are disabled globally. However, no bits in the DMA interrupt enable register are changed. 



START 

in 



IDLE UNTIL ENABLED INTERRUPT IS RECEIVED 



I 



DISABLE DMA INTERRUPTS GLOBALLY 



DMA CHANNEL PERFORMS A READ 



DMA CHANNEL PERFORMS A WRITE 



ENABLE DMA INTERRUPTS GLOBALLY 



GO TO START 



Figure 9-36. DMA Source Synchronization 
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Destination Synchronization 

When SYNCH = 1 0, the DMA is synchronized to the destination. First, all interrupts are 
ignored until the read is complete. Though the DMA interrupts may be considered to be 
globally disabled, no bits in the DMA interrupt enable register are changed. A write will 
not be performed until an interrupt is received by the DMA. Figure 9-37 shows the syn- 
chronization mechanism when SYNCH = 10. 



START 



DMA INTERRUPTS ARE DISABLED GLOBALLY 

i 



DMA CHANNEL PERFORMS A READ 



J 



DMA INTERRUPTS ARE ENABLED GLOBALLY 



IDLE UNTIL ENABLED INTERRUPT IS RECEIVED 



DISABLE DMA INTERRUPTS GLOBALLY 



DMA CHANNEL PERFORMS A WRITE 



GO TO START 



Figure 9-37. DiVIA Destination Synchronization 
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Source and Destination Synchronization 

When SYNCH = 1 1, all interrupts are ignored, and therefore can be considered to be 
globally disabled. However, no bits in the DMA interrupt enable register are changed. A 
read is performed when an interrupt is received. A write is performed on the following 
interrupt. Source and destination synchronization when SYNCH = 1 1 is shown in Figure 
9-38. 



START 



IDLE UNTIL ENABLED INTERRUPT IS RECEIVED 



I 



DISABLE DMA INTERRUPTS GLOBALLY 



DMA CHANNEL PERFORMS A READ 



ENABLE DMA INTERRUPTS GLOBALLY 



IDLE UNTIL ENABLED INTERRUPT IS RECEIVED 



DISABLE DMA INTERRUPTS GLOBALLY 



DMA CHANNEL PERFORMS A WRITE 



ENABLE DMA INTERRUPT GLOBALLY 



GO TO START 



Figure 9-38. DIVIA Source and Destination Synchronization 
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Section 10 

Pipeline Operation 



TMS320C30 operation is controlled by five major functional units: fetch, de- 
code, read, execute, and DMA. To provide for maximum processor through- 
put, these units can perform in parallel, with each unit operating on a different 
instruction. The overlapping of the fetch, decode, read, and execute oper- 
ations of different instructions is called pipelining. The pipelining of these 
operations results in the high performance of the TMS320C30. The ability of 
the DMA to move data within the processor memory space results in an even 
greater utilization of the CPU with fewer interruptions of the pipeline, thus 
yielding greater performance. 

Major topics discussed in this section are as follows: 
© Pipeline Structure (Section 10.1 on page 10-2) 

9 Pipeline Conflicts (Section 10.2 on page 10-4) 

- Branch conflicts 

- Register conflicts 

- Memory conflicts 

« Resolving Memory Conflicts (Section 10.3 on page 10-14) 

9 Clocking of Memory Accesses (Section 10.4 on page 10-16) 

- Program fetches 

- Data loads and stores 

- DMA accesses 
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10.1 Pipeline Structure 



The five major units of the TMS320C30 pipeline structure and their function 
are as follows: 



Fetch Unit (F) 
Decode Unit (D) 



Read Unit (R) 
Execute Unit (E) 



Fetches the instruction words from memory and 
updates the program counter (PC). 

Decodes the instruction word and performs ad- 
dress generation. Any modification of the auxiliary 
registers and the stack pointer is controlled by this 
unit. 

If required, reads the operands from memory. 

If required, reads the operands from the register 
file, performs the necessary operation, and if 
needed writes results to the register file. If re- 
quired, results of previous operations are written 
to memory. 

DMA Channel (DMA) Reads and writes memory. 

The basic instruction has four levels: fetch, decode, read, and execute. Figure 
10-1 illustrates these four levels of the pipeline structure. The levels are in- 
dexed according to instruction and execution cycle. Also indicated is a place 
in the pipeline where all four units operate in parallel; the perfect overlap oc- 
curs at cycle {m). Those levels about to be executed are at m+'\ , and those 
just executed are at m-^ . The TMS320C30 pipeline control allows for an ex- 
tremely high-speed execution rate by allowing an effective rate of one exe- 
cution per cycle. It also manages pipeline conflicts in a way that makes them 
transparent to the user. The user does not need to take any special precautions 
to guarantee correct operation. 



INSTRUCTION 



CYCLE 

I m-3 I m-2 I m-1 I m 



I m + 1 I m+2 I 



I I F I D I R I E I 

J I F I D I R I E 

K I F I D I R 

J I F I D 

t 
PERFECT 
OVERLAP 

Figure 10-1. TMS320C30 Pipeline Structure 



E I 
R I E 
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Priorities have been assigned to each of the functional units. The priorities 
from highest to lowest are: 

O Execute (highest) 

• Read 

• Decode 

• Fetch 

• DMA (lowest). 

When processing of an instruction is ready to pass to the next higher pipeline 
level, but that level is not ready to accept a new input, a pipeline conflict oc- 
curs, in this case, the lower priority unit waits until the higher priority unit 
completes its currently executing function. 

Despite the DMA controllers low priority, conflicts with the CPU can be min- 
imized or even eliminated by suitable data structuring since the DMA con- 
troller has its own data and address buses. 
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10.2 Pipeline Conflicts 



The pipeline conflicts of the TMS320C30 can be grouped into the following 
main categories: 

Branch Conflicts Involve most of those instructions or operations which 
read and/or modify the PC. 

Register Conflicts Involve delays that can occur when reading or writing 
registers used for address generation. 

Memory Conflicts Occur when the internal units of the TMS320C30 
compete for memory resources. 

Each of these three types is discussed in the following sections. Examples are 
included. Note in these examples, when data is refetched or an operation is 
repeated, the symbol representing the stage of the pipeline is appended with 
a number. For example, if a fetch is performed again, the initial fetch is labeled 
F1 and the refetch is labeled F2. When an access is detained multiple cycles 
due to a 'not ready,' the symbols RDY and RDY are used to indicate not ready 
and ready, respectively. 



10.2.1 Branch Conflicts 




The first class of pipeline conflicts is that which occurs with standard (non- 
delayed) branches, i.e., BR, Bcond, DBcond, CALL, IDLE, RPTB, RPTS, 
REJlcond, RETScond, interrupts, and reset. Conflicts arise with these in- 
structions and operations since during their execution, the pipeline is used 
only for the completion of the operation; other information fetched into the 
pipeline is discarded or refetched, or the pipeline is inactive. This is referred 
to as flushing the pipeline. Flushing the pipeline is necessary in these cases 
to guarantee that portions of succeeding instructions do not inadvertantly get 
partially executed. JRAPcond and CALLcond are classified somewhat differ- 
ently from the other types of branches and are considered later. 

Example 10-1 shows the code and pipeline operation for a standard branch. 
Note that one dummy fetch is performed (F1 ), and then after the branch ad- 
dress is available, a new fetch (F2) is performed. This dummy fetch will affect 
the cache. 



10-4 



Pipeline Operation - Pipeline Conflicts 



Example 10-1. Standard Branch 

BR THREE 

MPYF 

ADDF 

SUBF 

AND 



Unconditional branch 
Not executed 
Not executed 
Not executed 
Not executed 



THREE OR 
STI 



; Fetched after BR is fetched 



PIPELINE OPERATION 



BR THREE 

OR 

STI 



THREE-*PC 
D I R I E I 

F1 I (nop) I (nop) | F2 | 



RPTS and RPTB both flush the pipeline, thus allowing for the RS, RE, and 
RC registers to be loaded at the proper time relative to the flow of the pipeline. 
If these registers are loaded without the use of RPTS or RPTB, no flushing of 
the pipeline occurs. If none of the repeat modes are being used, RS, RE, and 
RC may be used as general-purpose 32-bit registers without any pipeline 
conflicts occurring. In cases such as the nesting of RPTB due to nested in- 
terrupts, it may be necessary to load and store these registers directly while 
using the repeat modes. Since up to four instructions can be fetched before 
entering the repeat mode, loads should be followed by a branch to flush the 
pipeline. If the RC is changing when an instruction is loading it, the direct load 
takes priority over the modification made by the repeat mode logic. 

Delayed branches are implemented to guarantee the fetching of the next three 
instructions. The delayed branches include BRD, BcondD, and DBcondD. 
Example 10-2 shows the code and pipeline operation for a delayed branch. 
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Example 10-2. Delayed Branch 



BRD 


THREE 


Unconditional delayed branch 


MPYF 




• Executed 


ADDF 




Executed 


SUBF 




Executed 


AND 




Not executed 


THREE MPYF 




• Fetched after SUBF fetched 



PIPELINE OPERATION 



BRD THREE 

MPYF 

ADDF 

SUBF 

MPYF 



D 


1 R 


THREE-»PC 
1 E 1 






F 


1 D 


1 R 


1 


E 


1 




1 F 


1 D 


1 


R 


1 E 






1 F 


1 
1 


D 
F 


1 R 
1 D 



10.2.2 Register Conflicts 




Register conflicts involve the reading or writing of registers used for address- 
ing purposes. These conflicts occur when the pertinent register is not ready 
to be used. The registers comprise the following three functional groups: 

Group 1 Auxiliary registers (AR0-AR7), index registers (IRO, IR1), and 
block size register (BK) 

Group 2 Data page pointer (DP) 

Group 3 System stack pointer (SP) 

If an instruction writes to one of these three groups, the use of any register 
within that particular group by the decode unit is delayed until the write is 
complete, i.e. instruction execution is completed. In Example 10-3, an auxil- 
iary register is loaded, and a different auxiliary register is used on the next in- 
struction. Since the decode stage needs the result of the write to the auxiliary 
register, the decode of this second instruction is delayed two cycles. Every 
time the decode is delayed, a refetch of the program word is performed; i.e., 
the first fetch of ADDF is at F1 , followed by F2 and F3 (the final fetch). Since 
these are actual refetches, they can cause conflicts with the DMA controller 
and cache hits and misses. 
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Example 10-3. Write to an AR Followed by an AR for Address Generation 

LDI 7,AR1 ; 7 -^ ARl 

NEXT MPYF *AR2,R0 ; Decode delayed 2 cycles 
ADDF 
FLOAT 



PIPELINE OPERATION 



LDI 7, ARl 1 


F 


1 D 


1 R 


7-*AR1 

1 E 1 






MPYF *AR2,R0 




1 F 


1 D1 


1 D2 1 


D3 


1 R 


ADDF 






1 F1 


1 F2 1 


F3 


1 D 


FLOAT 












1 F 



The case for reads of these groups is similar to the case for writes. If an in- 
struction must read a member of one of these groups, the use of that particular 
group by the decode for the following instruction is delayed until the read is 
complete. The registers are read at the start of the execute cycle and therefore 
require only a one cycle delay of the following decode. For four registers (IRO, 
IR1, BK, or DP) no delay is incurred. In all other cases, including the SP, the 
delay occurs. In Example 1 0-4, two auxiliary registers are added together with 
the result going to an extended-precision register. The next instruction uses 
a different auxiliary register as an address register. 

Example 10-4. A Read of ARs Followed by ARs for Address Generation 

ADDI AR0,AR1,R1 ; ARO + ARl -♦ Rl 
NEXT MPYF *++AR2,R0 ; Decode delayed 1 cycle 
ADDF 
FLOAT 



PIPELINE OPERATION 




ADDI 1 F 


1 D 


1 R 


AR0+AR1 

1 E 


-*R1 
1 






MPYF* +-(-AR2,R0 


1 F 


1 D1 


1 D2 


1 


R 


1 E 


ADDF 




1 F1 


1 F2 


1 


D 


1 R 


FLOAT 








1 


F 


1 D 



Note that while the DBR (decrement and branch) instruction does not use the 
auxiliary registers for addressing, its use of them as loop counters is treated 
as if it did. Therefore, the operation shown in the two previous examples can 
also occur for this instruction. 
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10.2.3 Memory Conflicts 



Possible memory conflicts occur when the memory bandwidth of a physical 
memory space is exceeded. For example, RAM blocks and 1 and the ROM 
block can support only two accesses every cycle. The external interface can 
support only one access per cycle. Some conditions under which memory 
conflicts can be easily avoided are discussed in Section 10.3. 

Memory pipeline conflicts consist of the following four types: 

Program Wait A program fetch is prevented from begin- 

ning. 

Program Fetch Incomplete A program fetch has begun, but is not yet 

complete. 

Execute Only An instruction sequence requires three 

CPU-data accesses in a single cycle. 

Hold Everything A primary or expansion bus operation must 

complete before another one can proceed. 

These four types of memory conflicts are discussed in the succeeding para- 
graphs and examples provided. 

Program Walt 

Two conditions can prevent the program fetch from beginning: 

• The start of a CPU-data access when: 

- Two CPU-data accesses are made to an internal RAM or ROM 
block, and a program fetch from the same block is necessary. 

- One of the external ports is starting a CPU-data access, and a 
program fetch from the same port is necessary. 

• A multicycle CPU-data access or DMA-data access over the external bus 
is needed. 
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An example of program wait until a CPU-data access completes is illustrated 
in Example 10-5. In this case, *ARO and *AR1 are both pointing to data in 
RAM block 0, and the MPYF instruction will be fetched from RAM block 0. 
This results in the conflict shown in Example 10-5. Since no more than two 
accesses can be made to RAM block in a single cycle, the program fetch 
cannot begin, and must wait until the CPU-data accesses are complete. 

Example 10-5. Program Wait Until CPU-Data Access Completes 

ADDF3 *AR0,*AR1,R0 

FIX 

MPYF 

ADDF3 

NEGB 



PIPELINE OPERATION 



ADDF3 *AR0,*AR1,R0 
FIX 

MPYF 
ADDF 
NEGB 



F I 



*ARO MEMORY READ 
•AR1 MEMORY READ 

D I R I E 



(wait) 
I F 



R I E I 
D I R I 
FID! 



Example 10-6 shows an example of a program wait due to a multicycle data- 
data access or a multicycle DMA access. The ADDF, MPYF, and SUBF are 
fetched from some portion in memory other than the external port the DMA 
requires. The DMA begins a multicycle access. The program fetch corre- 
sponding to the CALL is made to the same external port the DMA is using. 
Even though the DMA has the lowest priority, multi-cycle access cannot be 
aborted. The program fetch must therefore wait until the DMA access com- 
pletes. 

Example 10-6. Program Wait Due to Multicycle Access 

ADDF I F I D I R I E 




MPYF 
SUBF 

CALL 



D I R 

F I D 

(wait) 
I F 
2 cycle DMA access 



E I 

R I E 

F I D 
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Program Fetch Incomplete 

A program fetch incomplete occurs when a program fetch takes more than one 
cycle to complete due to wait states. In Example 10-7, the MPYF and ADDF 
are fetched from memory that supports single-cycle accesses. The SUBF is 
fetched from memory requiring one wait state. 

Example 10-7. Multicycle Program Memory Fetches 

MPYF I F I D I R I E | 

ADDF I F I D I R I E I 

RDY RDY 

SUBF I F I F I D I R I E I 

ADDI I F I D I R I 

Execute Only 

The Execute Only type of memory pipeline conflicts occurs when a sequence 
of instructions requires three CPU-data accesses in a single cycle or when 
performing an interlocked load. The three cases where this occurs are: 

• An instruction that performs a store, followed by an instruction that does 
two memory reads. 

• An instruction that performs two stores, followed by an instruction that 
performs at least one memory read. 

• An interlocked load (LDII or LDFI) instruction is performed, and XF1 = 
1. 
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The first case is shown in Example 10-8. Since this sequence requires three 
data memory accesses and only two are available, only the execute phase of 
the pipeline is allowed to proceed. The dual reads required by the LDF || LDF 
will be delayed one cycle. Note that a refetch of the next instruction can oc- 
cur. 



Example 10-8. Single Store Followed by Tvjo Reads 



STF 

LDF 

1 1 LDF 


R0,*AR1 
*AR2,R1 
*AR3,R2 


; RO -* *. 
; *AR2 -^ 
; *AR3 -* 


ARl 
Rl 
R2 


in 


parallel 


with 




PIPELINE OPERATION 












STF R0,*AR1 


1 F 1 


D 1 


R 


RO 

1 


-♦•AR1 

E 1 




LDF| |LDF 


1 


F I 


D 


1 


Rl 


•AR2-+R1 
*AR3-*'R2 

1 R2 1 


E 






1 


F 


1 
1 


D1 
F1 


1 D2 1 
1 F2 1 


R 
D 



Example 10-9 shows a parallel store followed by a single load or read. Since 
the two parallel stores are required, the next CPU -data memory read must wait 
a cycle before beginning. One program memory refetch may occur. 

Example 10-9. Parallel Store Followed by Single Read 



STF 

STF 

ADDF 

lACK 

ASH 



RO , *ARO 
R2 , *AR1 
@SUM,R1 



RO -» *AR0 in parallel with 

R2 -♦ *AR1 

Rl + @SUM -» Rl 



PIPELINE OPERATION 




STF R0,*AR1 
STF R2,*AR1 1 


F 


1 D 


1 R 


RO-**ARO 
R2-»*AR1 

1 E 1 






ADDF (aSUM,Rl 




1 F 


1 D 


(wait) 

1 R 1 


@SUM 

R 1 


E 1 


lACK 






1 F 


1 D1 1 


D2 1 


R 1 


ASH 








1 F1 1 


F2 1 


D 1 
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The final case involves an interlocked load (LDII or LDFI) instruction and XF1 
= 1 . Since the interlocked loads use the XF1 pin as an acknowledge that the 
read can complete, they may need to extend the read cycle, as shown in Ex- 
ample 10-10. Note that a program refetch may occur. 



Example 10-10. Interlocked Load 

NOT I F 

LDII 
ADD I 
CMP I 



XF1=1 
R 

D1 



XF1=0 

I R 
I D2 



I F1 I F2 I 



Hold Everything 

The three types of Hold Everything memory pipeline conflicts are: 

• A CPU-data load or store cannot be performed because an external port 
is busy. 

© An external load that takes more than one cycle. 

• Conditional calls and traps. 

The first type of Hold Everything conflict occurs when one of the external 
ports is busy due to an access that has started but is not complete. In Example 
10-1 1, the first store is a two-cycle store. The CPU writes the data to an ex- 
ternal port. The port control then takes two cycles to complete the data-data 
write. The LDF is a read over the same external port. Since the store is not 
complete, the LDF will continue to be attempted until the port is available. 
For this case, the first dummy fetch occurs at the same time as D2. 




Example 10-11. Busy External Port 



STF 
LDF 



R0,@DMA1 
(aDMA2,R0 



PIPELINE OPERATION 



STF 
LDF 



D 


1 2-cycle 

1 R 


DMA 
1 


access | 
E 1 






F 


1 D 


1 


nop 1 


R 


E 1 




1 F 


1 


D1 1 


D2 


R 1 






1 


F1 1 


F2 


D 1 
F 1 
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The second type of Hold Everything conflict involves multicycle data reads. 
The read has begun and continues until completed. In Example 10-12, the 
LDF is performed from an external memory that requires several cycles to 
complete. 

Example 10-12. Multicycle Data Reads 

I F I D I R I E 



LDF @DMA,RO 



2-cycl 
R 

D1 



read 
R 

D2 

nop 

F1 



D |. 
F2 |. 



The final type of Hold Everything conflict deals with conditional calls and 
traps, which are different from the other branch instructions. Whereas the 
other branch instructions are conditional loads, the conditional calls and traps 
are conditional stores, which take one cycle more than a standard branch (see 
Example 10-13). 



Example 10-13. Conditional Calls and Traps 

CALLcond I F | D | R 



F1 



(nop) I (nop) 



(store) 

E I 
F2 I F3 I 
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10.3 Resolving Memory Conflicts 

If program fetches and data accesses are performed in such a manner that the 
resources being used cannot provide the necessary bandwidth, the program 
fetch is delayed until the data access is complete. Certain configurations of 
program fetch and data accesses yield conditions under which the 
TMS320C30 can achieve maximum throughput. Table 10-1 shows how many 
accesses can be performed from the different memory spaces when it is nec- 
essary to do a program fetch and a single data access, and still achieve maxi- 
mum performance (one cycle). There are four cases that achieve one cycle 
maximization (see Table 10-1). Table 10-2 shows how many accesses can 
be performed from the different memory spaces when it is necessary to do a 
program fetch and two data accesses, still achieving maximum performance 
(one cycle). There are six cases that achieve this maximization (see Table 
10-2). 

Table 10-1 . One Program Fetch and One-Data Access for Maximum 

Performance 



CASE# 


PRIMARY BUS 
ACCESSES 


ACCESSES FROM 

DUAL-ACCESS 

INTERNAL MEMORY 


EXPANSION BUS 

OR PERIPHERAL 

ACCESSES 










1 


1 


1 


- 


2 


1 


- 


1 


3 




2 from any 

combination 

of internal memory 




4 


- 


1 


1 
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Table 10-2. One Program Fetch and Two Data Accesses for 
Maximum Performance 



CASE# 


PRIMARY BUS 
ACCESSES 


ACCESSES FROM 

DUAL-ACCESS 

INTERNAL MEMORY 


EXPANSION OR 

PERIPHERAL BUS 

ACCESSES 










1 


1 


2 from any 

combination 

of internal memory 


— 


2 


1 Program 


1 Data 


1 Data 


3 


1 Data 


1 Data 


1 Program 


4 




2 from same internal 

memory block and 

1 from a different 

internal memory 

block 




5 




3 from different 

internal memory 

blocks 




6 




2 from any 

combination 

of internal memory 


1 
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10.4 Clocking Of Memory Accesses 

Internal clock phases (HI and H3) and their relationship to memory accesses 
are discussed in this section to show how the TMS320C30 handles multiple 
memory accesses. Whereas the previous section discussed the interaction 
between sequences of instructions, this section discusses the flow of data on 
an individual instruction basis. 

Each major clock period of 60 ns is composed of two minor clock periods of 
30 ns, labeled as H3 and HI . 



V- 



Major Clock Period 




The precise operation of memory reads and writes can be defined, based upon 
these minor clock periods. The types of memory operations which can occur 
are program fetches, data loads and stores, and DMA accesses. 



10.4.1 Program Fetches 




Internal program fetches are always performed during H3 unless a single data 
store must occur at the same time, due to another instruction in the pipeline. 
In this case, the program fetch occurs during HI and the data store during 
H3. 

External program fetches always start at the beginning of H3 with the address 
being presented on the external bus. At the end of HI, they are completed 
with the latching of the instruction word. 



10.4.2 Data Loads and Stores 



Four types of instructions perform loads, memory reads, and stores: two-ope- 
rand instructions, three-operand instructions, multiplier/ALU operation with 
store instructions, and parallel multiply and add instructions. See Section 6 for 
detailed information on addressing modes. 

Two-Operand Instruction Memory Accesses 

Two-operand instructions include all those instructions with bits 31 -29 being 
000 or 010 (see Figure 10-2). In the case of a data read, bits 15-0 represent 
the src operand. Internal data reads are always performed during HI . External 
data reads always start at the beginning of H3 with the address being pre- 
sented on the external bus, and complete with the latching of the instruction 
word at the end of HI. 
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In the case of a data store, bits 1 5-0 represent the dst operand. Internal data 
stores are performed during H3. External data stores always start at the be- 
ginning of H3 with the address and data being presented on the external bus. 



31 


24 23 






1615 


87 





0X0 


Operation 


G 


1 1 1 1 
dst(src) 


src(cist) 



Figure 10-2. Two-Operand Instruction Word 



Three-Operand Instruction Memory Reads 

Three-operand instructions include all instructions with bits 31 -29 being 001 
(see Figure 10-3). The source operands, srcl and src2, come from either reg- 
isters or memory. When one or more of the source operands are from memory, 
these instructions are always memory reads. 

If only one of the source operands is from memory (either srcl or src2) and 
is located in internal memory, the data is read during HI . If the single memory 
source operand is in external memory, the read starts at the beginning of H3, 
with the address being presented on the external bus, and completes with the 
latching of the data word at the end of HI . 

If both source operands are to be fetched from memory, then several cases 
occur. If both operands are located in internal memory, the srcl is performed 
during H3 and src2 during HI, thus completing two memory reads in a single 
cycle. 

If srcl is in internal memory and src2 in external memory, the src2 access is 
begun at the start of H3 and latched at the end of HI . At the same time, the 
srcl access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 

If srcl is in external memory and src2 in internal memory, two cycles are nec- 
essary to complete the two reads. In the first cycle, the internal src2 access 
is performed. The srcl is also performed, but not latched until the next H3. 

If srcl and src2 are both from external memory, two cycles are required to 
complete the two reads. In the first cycle, the srcl access is performed and 
loaded on the next H3; in the second cycle, the src2 access is performed and 
loaded on that cycle's HI . 
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Figure 10-3. Three-Operand Instruction Word 
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Operations with Parallel Stores 

The next class of instructions includes all instructions that have a store, in 
parallel with another instruction. Bits 31 and 30 for these instruction are equal 
tol 1. 

For those operations that perform a multiply or ALU operation in parallel with 
a store, the instruction word format is shown in Figure 10-4. If the store op- 
eration to dst2 is external or internal, it is performed during H3. 

If the memory read operation is external, it is started at the beginning of H3 
and latched at the end of HI. If the memory read operation is internal, it is 
performed during HI. Note that memory reads are performed by the CPU 
during the read (R) phase of the pipeline, and stores during the execute (E) 
phase. 



31 



24 23 
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T — I — I — r 
Operation 



-1 — r 
dsti 



-1 — r 
srcl 



-1 — r 
src3 



-T — I — 1 — I — I — I — r 
dst2 



T — I — I — I — I — r 
src2 



1 1 



Figure 10-4. A Multiply or CPU Operation with a Parallel Store 




The instruction word format for those instructions that have parallel stores to 
memory is shown in Figure 10-5. If both destination operands, dsti and 
dst2, are located in internal memory, dsti is stored during H3 and dst2 during 
HI, thus completing two memory stores in a single cycle. 

If dsti is in external memory and dst2 in internal memory, the dsti store is 
begun at the start of H3. The dst2 store to internal memory is performed 
during HI . Again, two memory stores are completed in a single cycle. 

If dsti is in internal memory and dst2 in external memory, two cycles are ne- 
cessary to complete the dst2 store. In the first cycle, the internal dsti store is 
performed during H3. During the next cycle, the dst2 store is performed be- 
ginning in H3. 

If dsti and dst2 are both from external memory, two cycles are necessary to 
complete the dst2 store. In the first cycle, the dsti access is performed; in the 
second cycle, the dst2 access is performed. 
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Figure 10-5. Two Parallel Stores 
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Parallel Multiplies and Adds 

The considerations of memory addressing for parallel multiplies and adds is 
similar to that for three-operand instructions. The parallel multiplies and adds 
include all instructions with bits 31 -30 equal to 10 (see Figure 10-6). 

For these operations, src3 and src4 are both located in memory. If both op- 
erands are located in internal memory, src3 is performed during H3 and src4 
during HI, thus completing two memory reads in a single cycle. 

If src3 is in internal memory and src4 in external memory, the src4 access is 
begun at the start of H3 and latched at the end of HI . At the same time, the 
srcS access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 

If srcS is in external memory and src4 in internal memory, two cycles are nec- 
essary to complete the two reads. In the first cycle, the internal src4 access 
is performed. During the H3 of the next cycle, the srcS access is performed. 

If srcS and src4 are both from external memory, two cycles are necessary to 
complete the two reads. In the first cycle, the srcS access is performed; in the 
second cycle, the src4 access is performed. 
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Section 11 

Assembly Language Instructions 



The TMS320C30 assembly language instruction set supports numeric- 
intensive signal processing and general-purpose applications. The instructions 
are organized into major groups consisting of load and store, two- or three- 
operand arithmetic/logical, parallel, program control, and interlocked oper- 
ations instructions. The addressing modes used with the instructions are 
described in Section 6. 

An additional feature of the TMS320C30 instruction set is the capability of 
using one of 19 condition codes with any of the 10 conditional instructions, 
such as LDFcond. This section defines the condition codes and flags. 

The assembler allows optional syntax forms to simplify the assembly language 
for special-case instructions. These optional forms are listed and explained. 

Each of the individual instructions is described and listed in alphabetical order. 
An illustration showing an example instruction (see pages 11-15 through 
11-17) is provided to show the special format used and explain its content. 

Major topics discussed in this section are as follows: 

• Instruction Set (Section 1 1 .1 on page 1 1 -2) 

- Load and store instructions 

- Two-operand arithmetic/logical instructions 

- Three-operand arithmetic/logical instructions 

- Program control instructions 

- Interlocked operations instructions 

- Parallel operations instructions 

O Condition Codes and Flags (Section 1 1 .2 on page 1 1 -8) 

• Individual Instructions (Section 1 1 .3 on page 11-11) 

- Symbols and abbreviations used in instructions 

- Optional assembler syntaxes 

- Individual instruction descriptions alphabetized and including: 

Syntax 

Operation 

Operands 

Encoding 

Description 

Cycles 

Status bits 

Mode bit 

Example(s) 
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11.1 Instruction Set 



The TMS320C30 instruction set is exceptionally well suited to digital signal 
processing and other numeric-intensive applications. All instructions are a 
single machine word long, and most instructions take a single cycle to exe- 
cute. In addition to multiply and accumulate instructions, the TMS320C30 
possesses a full complement of general-purpose instructions. 

The instruction set contains 114 instructions organized into the following 
functional groups: 

Load and store 

Two-operand arithmetic/logical 
Three-operand arithmetic/logical 
Program control 
Interlocked operations 
Parallel operations. 

Each of these groups is discussed in the succeeding subsections. 

11.1.1 Load and Store Instructions 

The TMS320C30 supports 12 load and store instructions (see Table 11-1). 
These instructions can: 



• Load a word from memory into a register, 

• Store a word from a register into memory, or 

• Manipulate data on the system stack. 

Two of these instructions can load data conditionally. This is useful for locat- 
ing the maximum or minimum value in a data set. See Section 1 1 .2 for detailed 
information on condition codes. 

Table 11-1. Load and Store Instructions 




INSTRUCTION 


DESCRIPTION 


INSTRUCTION 


DESCRIPTION 


LDE 


Load floating-point exponent 


POP 


Pop integer from stack 


LDF 


Load floating-point value 


POPF 


Pop floating-point value from 
stack 


LDfcond 


Load floating-point value 
conditionally 


PUSH 


Push integer on stack 


LDI 


Load integer 


PUSHF 


Push floating-point value on 
stack 


LD\cond 


Load integer conditionally 


STF 


Store floating-point value 


LDM 


Load floating-point mantissa 


STI 


Store integer 
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11.1.2 Two-Operand Instructions 

The TMS320C30 supports a complete set of two-operand arithmetic and 
logical instructions. The two operands are the source and destination. The 
source operand may be a memory word, a register, or a part of the instruction 
word. The destination operand is always a register. 

These instructions provide integer, floating-point, or logical operations, and 
multiprecision arithmetic. Table 11-2 lists these instructions. 

Table 11-2. Two-Operand Instructions 



INSTRUCTION 


DESCRIPTION 


INSTRUCTION 


DESCRIPTION 


ABSF 


Absolute value of a floating- 
point number 


NORM 


Normalize floating-point value 


ABSI 


Absolute value of an integer 


NOT 


Bitwise logical-complement 


ADDC 




Add integers with carry 


OR t 


Bitwise logical-OR 


ADDF 




Add floating-point values 


RND 


Round floating-point value 


ADD! 




Add integers 


ROL 


Rotate left 


AND 




Bitwise logical- AND 


ROLC 


Rotate left through carry 


ANDN 




Bitwise logical-AND with 
complement 


ROR 


Rotate right 


ASH 




Arithmetic shift 


RORC 


Rotate right through carry 


CMPF 




Compare floating-point values 


SUBB t 


Subtract integers with borrow 


CMP! 




Compare integers 


SUBC 


Subtract integers conditionally 


FIX 


Convert floating-point value to 
integer 


SUBF 


Subtract floating-point values 


FLOAT 


Convert Integer to floating-point 
value 


SUBI 


Subtract integer 


LSH 


t 


Logical shift 


SUBRB 


Subtract reverse integer with 
borrow 


MPYF 


t 


Multiply floating-point values 


SUBRF 


Subtract reverse floating-point 
value 


MPYI 


t 


Multiply integers 


SUBRI 


Subtract reverse integer 


NEGB 


Negate integer with borrow 


TSTB t 


Test bit fields 


NEGF 


Negate floating-point value 


XOR t 


Bitwise exclusive-OR 


NEGI 


Negate integer 








t Two- and three-operand versions 
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11.1.3 Three-Operand Instructions 

Most instructions have only two operands; however, some arithmetic and 
logical instructions have three-operand versions. Three-operand instructions 
allow the TMS320C30 to read two operands from memory or the CPU register 
file in a single cycle and store the results in a register. The following differ- 
entiates the two- and three-operand instructions: 

• Two-operand instructions have a single source operand (or shift count) 
and a destination operand. 

• Three-operand instructions may have two source operands (or one 
source operand and a count operand) and a destination operand. A 
source operand may be a memory word or a register. The destination 
of a three-operand instruction is always a register. 

Table 11-3 lists the instructions that have three-operand versions. Note that 
the '3' in the mnemonic can be omitted from three-operand instructions (see 
Section 11.3.2). 

Table 11-3. Three-Operand Instructions 



INSTRUCTION 


DESCRIPTION 


INSTRUCTION 


DESCRIPTION 


ADDC3 


Add with carry 


MPYF3 


Multiply floating-point values 


ADDF3 


Add floating-point values 


MPYI3 


Multiply integers 


ADDIS 


Add integers 


0R3 


Bitwise logical-OR 


AND3 


Bitwise logical-AND 


SUBB3 


Subtract integers with borrow 


ANDN3 


Bitwise logical-AND with 
complement 


SUBF3 


Subtract floating-point values 


ASH3 


Arithmetic shift 


SUBI3 


Subtract integers 


CMPF3 


Compare floating-point values 


TSTB3 


Test bit fields 


CMPI3 


Compare integers 


X0R3 


Bitwise exclusive-OR 


LSH3 


Logical shift 








11.1.4 Program Control Instructions 

The program-control instruction group consists of all of those instructions 
which affect program flow. The repeat mode allows repetition of a block of 
code (RPTB) or of a single line of code (RPTS). Both standard and delayed 
(single-cycle) branching are supported. Several of the program control in- 
structions are capable of conditional operations (see Section 1 1 .2 for detailed 
information on condition codes). Table 11-4 lists the program control in- 
structions. 



11-4 



Assembly Language Instructions - Instruction Set 



Table 11-4. Program Control Instructions 



INSTRUCTION 


DESCRIPTION 


INSTRUCTION 


DESCRIPTION 


Qcond 


Branch conditionally (standard) 


IDLE 


Idle until interrupt 


BcondD 


Branch conditionally (delayed) 


NOP 


No operation 


BR 


Branch unconditionally 
(standard) 


RETI cond 


Return from interrupt 
conditionally 


BRD 


Branch unconditionally 
(delayed) 


RETS cond 


Return from subroutine 
conditionally 


CALL 


Call subroutine 


RPTB 


Repeat block of instructions 


CALLcond 


Call subroutine conditionally 


RPTS 


Repeat single instruction 


DBcond 


Decrement and branch 
conditionally (standard) 


SWI 


Software interrupt 


DBcondD 


Decrement and branch 
conditionally (delayed) 


TRAP cond 


Trap conditionally 



11.1.5 Interlocked Operations Instructions 

The interlocked operations instructions support multiprocessor communi- 
cation. Through the use of external signals, these instructions allow for pow- 
erful synchronization mechanisms. They also guarantee the integrity of the 
communication and result in a high-speed operation. Refer to Section 7 for 
examples of the use of interlocked instructions. Table 1 1 -5 lists the five in- 
terlocked operations instructions. 

Table 11-5. Interlocked Operations Instructions 



INSTRUCTION 


DESCRIPTION 


INSTRUCTION 


DESCRIPTION 


LDFI 


Load floating-point value, 
interlocked 


STFI 


Store floating-point value, 
interlocked 


LDII 


Load integer, interlocked 


STII 


Store integer, interlocked 


SIGI 


Signal, interlocked 







11.1.6 Parallel Operations Instructions 

The parallel-operations instructions group allows for a high degree of paral- 
lelism. Some of the TMS320C30 instructions can occur in pairs that will be 
executed in parallel. These parallel instructions provide: 

• Parallel loading of registers, 

• Parallel arithmetic operations, or 

• Arithmetic/logical instructions used in parallel with a store instruction. 

Each instruction in a pair is entered as a separate source statement. The sec- 
ond instruction in the pair must be preceded by two vertical bars (||). Table 
11-6 lists the valid instruction pairs. 
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Table 11 -6. Parallellnstructions 




MNEMONIC 


DESCRIPTION 


OPERATION 


PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS 


ABSF 

11 STF 


Absolute value of a floating-point 


|src2| -♦ dsti 
11 src3 -♦ dst2 


ABSI 
11 STI 


Absolute value of an integer 


|src2| -» dsti 
1 1 src3 -* dst2 


ADDF3 
II STF 


Add floating-point 


srcl + src2 -♦ dsti 
1 1 src3 -♦ dst2 


ADDIS 

11 STI 


Add integer 


srcl + src2 -♦ dst1 
1 1 src3 -* dst2 


AND3 
II STI 


Bitwise logical-AND 


src1 AND src2 -* dsti 
1 1 src3 -♦ dst2 


ASH3 

11 ST! 


Arithmetic shift 


If count >.0: 

src2 << count -♦ dsti 
II src3 -♦ dst2 
Else: 

src2 >> IcountI -* dsti 
II src3 -♦ dst2 


FIX 

II STI 


Convert floating-point to integer 


Fix(src2) -* dsti 
1 1 src3 -♦ dst2 


FLOAT 
II STF 


Convert integer to floating-point 


Float(src2) -> dsti 
1 1 src3 -♦ dst2 


LDF 
II STF 


Load floating-point 


src2 -♦ dsti 
II src3 -» dst2 


LDI 
II STI 


Load integer 


src2 -♦ dsti 
1 1 src3 -» dst2 


LSH3 
11 ST! 


Logical shift 


If count > 0: 

src2 << count -> dsti 
II src3 -► dst2 
Else: 

src2 >> IcountI -♦ dsti 
1 1 src3 -♦ dst2 


MPYF3 
II STF 


Multiply floating-point 


srcl X src2 -♦ dsti 
1 1 src3 -* dst2 


MPYI3 
II STI 


Multiply integer 


srcl X src2 -* dsti 
1 1 src3 -* dst2 


NEGF 
11 STF 


Negate floating-point 


0- src2 -> dsti 
1 1 src3 -♦ dst2 



LEGEND: 

srcl - register addr (R0-R7) 
srcS - register addr (R0-R7) 
dsti - register addr (R0-R7) 



src2 - indirect addr (disp = 0, 1 , I RO, I R1 ) 
src4 - indirect addr (disp = 0, 1 , I RO, I R1 ) 
dst2 - indirect addr (disp = 0, 1 , I RO, I R1 ) 
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Table 11-6. Parallellnstructions (Concluded) 



MNEMONIC 


DESCRIPTION 


OPERATION 


PARALLEL ARITHMETIC WITH STORE INSTRUCTIONS (Concluded) 


NEGI 
II STI 


Negate integer 


- src2 -* dsti 
II src3 -* dst2 


N0T3 

II STI 


Complement 


src-* dsti 
1 1 src3 -» dst2 


0R3 

II STI 


Bitwise logical-OR 


srcl OR src2 -♦ dsti 
II src3 -♦ dst2 


STF 

II STF 


Store floating-point 


srcl -♦ dsti 
1 1 src3 -» dst2 


STI 

II ST! 


Store integer 


src1 -* dsti 
II src3 -♦ dst2 


SUBF3 

II STF 


Subtract floating-point 


scrl - src2 -*■ dst1 
1 1 src3 -* dst2 


SUBI3 

II STI 


Subtract integer 


srcl - src2 -♦ dst1 
II src3 -* dst2 


X0R3 

II STI 


Bitwise exclusive-OR 


srcl XOR src2 -♦ dsti 
1 1 src3 "* dst2 


PARALLEL LOAD INSTRUCTIONS 


LDF 
II LDF 


Load floating-point 


src2 -^ dsti 
II src4 -» dst2 


LDI 

II LDI 


Load integer 


src2 -♦ dsti 
1 1 src4 ->■ dst2 


PARALLEL MULTIPLY AND ADD/SUBTRACT INSTRUCTIONS 


MPYF3 
II ADDF3 


Multiply and add floating-point 


opi X op2 -► op3 
II op4 + op5 -♦ op6 


MPYF3 
II SUBF3 


Multiply and subtract floating-point 


opi X op2 -♦ op3 
II op4 - op5 -♦ op6 


MPYI3 
II ADDi3 


Multiply and add integer 


opi X op2 -♦ op3 
II op4 + op5 -* op6 


MPYI3 
II SUBI3 


Multiply and subtract integer 


opi X op2 -*■ op3 
II op4 - op5 -» op6 



LEGEND: 

srcl - register addr (R0-R7) 
src3 - register addr (R0-R7) 
dsti - register addr (R0-R7) 
op3 - register addr (RO or R1) 



src2- indirect addr (disp = 0, 1, IRO, IR1) 
src4- indirect addr (disp = 0, 1, IRO, IR1) 
dst2- indirect addr (disp = 0, 1, IRO, IR1) 
op6 - register addr (R2 or R3) 




op1,op2,op4,op5 - Two of these operands must be specified using register addr, 
and two must be specified using indirect addr. 



11-7 



Assembly Language Instructions - Condition Codes and Flags 



11.2 Condition Codes and Flags 

The TMS320C30 provides 20 condition codes that can be used with any of 
the conditional instructions, such as RETScond or LDFcond. The conditions 
include signed and unsigned comparisons, comparisons to zero, and compar- 
isons based on the status of individual condition flags. Note that all condi- 
tional instructions can accept the suffix 'U' to indicate unconditional 
operation. 

Seven condition flags provide information related to properties of the result 
of arithmetic and logical instructions. The condition flags are stored in the 
status register (ST). These flags are modified by the majority of instructions 
according to whether a result is generated when performing the specified op- 
eration to infinite precision or an output is written to the destination register. 
The formats for output values are shown in Table 1 1 -7. 

Table 11-7. Output Value Formats 



TYPE OF OPERATION 


OUTPUT FORMAT 


Floating-point 

integer 

Logical 


8-bit exponent, 1 sign bit, 31 -bit fraction 

32-bit integer 

32-bit unsigned integer 



The condition flags are affected by instructions in only the following cases: 

1 ) The destination register is one of the extended-precision registers (RO - 
R7) 

2) The instruction is one of the compare instructions (CMPF, CMPF3, 
CMPI, CMPI3, TSTB, or TSTB3). 

Case 1 allows for modification of the registers used for addressing without 
affecting the condition flags during computation. Case 2 makes it possible to 
set the condition flags based upon the contents of any of the CPU registers. 

The following list defines the condition flags and describes how the flags are 
set by most instructions. For specific details of the effect of a particular in- 
struction on the condition flags, see the description of that instruction in 
Section 9.2. 

N Negative Condition Flag. Logical operations assign N the state 

of the MSB of the output value. For integer and floating-point oper- 
ations, N is set if the result is negative, and cleared otherwise. Zero 
is considered to be positive. 

Z Zero Condition Flag. For logical, integer, and floating-point oper- 

ations, Z is set if the output is 0, and cleared otherwise. 

V Overflow Condition Flag. For integer operations, V is set if the 

result does not fit into the format specified for the destination (i.e., 
.232 < result < 2^^ - 1 ). Otherwise, V is cleared. For floating-point 
operations, V is set if the exponent of the result is greater than 127, 
otherwise,V is cleared. Logical operations always clear V. 

C Carry Flag. When an integer addition is performed, C is set if a carry 

occurs out of the bit corresponding to the MSB of the output. When 
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an integer subtraction is performed, C is set if a borrow occurs into the 
bit corresponding to the MSB of the output. Otherwise, for integer 
operations, C is cleared. The carry flag is unaffected by floating-point 
and logical operations. 

UF Floating-Point Underflow Condition Flag. A floating-point 
underflow occurs whenever the exponent of the result is less than or 
equal to -1 28. If a floating-point underflow occurs, UF is set, and the 
output value is set to 0. UF is cleared if a floating-point underflow 
does not occur. 

LV Latched Overflow Condition Flag, LV is set whenever V (over- 

flow condition flag) is set. Otherwise, it is unchanged. LV may only 
be cleared by a processor reset or by modifying it in the status register 
(ST). 

LUF Latched Underflow Condition Flag. LUF is set whenever UF 
(floating-point underflow flag) is set. LUF may only be cleared by a 
processor reset or by modifying it in the status register (ST). 

Table 11-8 lists the condition mnemonic, code, description, and flag for each 
of the 19 conditions. 
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Table 11-8. Condition Codes and Flags 




CONDITION 


CODE 


DESCRIPTION 


FLAG 


UNCONDITIONAL COMPARES 


U 


00000 


Unconditional 


Don't care 


UNSIGNED COMPARES 


LO 


00001 


Lower than 


C 


LS 


00010 


Lower or same 


CORZ 


HI 


00011 


Higher than 


-CAND ~Z 


HS 


00100 


Higher or same 


~C 


EQ 


00101 


Equal 


Z 


NE 


00110 


Not Equal 


~Z 


SIGNED COMPARES 


LT 


00111 


Less than 


N 


LE 


01000 


Less than or equal 


NORZ 


GT 


01001 


Greater than 


~N AND ~Z 


GE 


01010 


Greater than or equal 


~N 


EQ 


00101 


Equal 


Z 


NE 


00110 


Not equal 


~z 


COMPARE TO ZERO 


Z 


00101 


Zero 


z 


NZ 


00110 


Not zero 


~z 


P 


01001 


Positive 


~N AND ~Z 


N 


00111 


Negative 


N 


NN 


01010 


Nonnegative 


~N 


COMPARE TO CONDITION FLAGS 


NN 


01010 


Nonnegative 


~N 


N 


00111 


Negative 


N 


NZ 


00110 


Nonzero 


~Z 


Z 


00101 


Zero 


z 


NV 


01100 


No overflow 


~v 


V 


01101 


Overflow 


V 


NUF 


01110 


No underflow 


~UF 


UF 


01111 


Underflow 


UF 


NC 


00100 


No carry 


~C 


C 


00001 


Carry 


C 


NLV 


10000 


No latched overflow 


~LV 


LV 


10001 


Latched overflow 


LV 


NLUF 


10010 


No latched floating-point underflow 


~LUF 


LUF 


10011 


Latched floating-point underflow 


LUF 


ZUF 


10100 


Zero or floating-point underflow 


ZOR UF 



~ Logical complement 
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11.3 Individual Instructions 

This section contains the individual assembly language instructions for the 
TMS320C30. The instructions are listed in alphabetical order. Information, 
such as assembler syntax, operation, operands, encoding, description, cycles, 
status bits, mode bit, and examples, is provided for each instruction. An ex- 
ample instruction precedes the individual instruction listings to show the 
special format used and explain its content. 

Preceding the individual instruction descriptions, the symbols and abbrevi- 
ations used in the individual instructions are defined. In addition, some op- 
tional syntax forms allowed by the assembler are described. 

A functional grouping of the instructions is provided in Section 1.6. A com- 
plete instruction set summary can be found in Section 1 .6.8. Appendix B lists 
the opcodes for all the instructions. Refer to Section 6 for information on 
memory addressing. Code examples using many of the instructions are given 
in Section 12, Software Applications. 

11.3.1 Symbols and Abbreviations 

Table 11-9 lists the symbols and abbreviations used in the individual instruc- 
tion descriptions. 
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Assembly Language instructions - individual Instructions 



Table 11-9. Instruction Symbols 




SYMBOL 


MEANING 


src 

srcl 

src2 

src3 

src4 


Source operand 
Source operand 1 
Source operand 2 
Source operand 3 
Source operand 4 


dst 

dsti 

dst2 

disp 

cond 

count 


Destination operand 
Destination operand 1 
Destination operand 2 
Displacement 
Condition 
Shift count 


G 

T 
P 
B 


General addressing modes 
Three-operand addressing modes 
Parallel addressing modes 
Conditional-branch addressing modes 


ARn 

IRn 

Rn 

RC 

RE 

RS 

ST 


Auxiliary register n 
Index register n 
Register address n 
Repeat count register 
Repeat end address register 
Repeat start address register 
Status register 


C 

GIE 

N 

PC 

RiVI 

SP 


Carry bit 

Global interrupt enable bit 

Trap vector 

Program counter 

Repeat mode flag 

System stack pointer 


|x| 

X -* y 
x(man) 
x{exp) 


Absolute value of x 
Assign the value of x to destination y 
Mantissa field (sign + fraction) of x 
Exponent field of x 


opi 

llop2 


Operation 1 performed in parallel with operation 2 


xANDy 
xOR y 
xXORy 
~x 


Bitwise logical-AND of x and y 
Bitwise logical-OR of x and y 
Bitwise logical-XOR of x and y 
Bitwise logical-complement of x 


X << y 
X >> y 
* ++SP 
*SP-- 


Shift X to the left y bits 

Shift X to the right y bits 

Increment SP and use incremented SP as address 

Use SP as address and decrement SP 



11-12 



Assembly Language Instructions - Individual Instructions 

11.3.2 Optional Assembler Syntaxes 

The assembler allows a relaxed syntax form for some of the instructions. 
These optional forms simplify the assembly language so that special-case 
syntax can be ignored for some of the instructions. The following is a list of 
these optional syntax forms. 

O The destination register can be omitted on unary arithmetic and logical 
operations when the same register is used as a source. For example, 

ABSI RO,RO can be written as absi ro 

Instructions affected: ABSI, ABSF, FIX, FLOAT, NEGB, NEGF, NEGI, 
NORM, NOT, RND. 

O All 3-operand instructions can be written without the '3'. For example, 

ADDI3 R0,Rl,R2 can be written as addi ro,ri,r2 

Instructions affected: ADDC3, ADDF3, ADDI3, AND3, ANDN3, ASH3, 
LSH3, MPYF3, MPYI3, 0R3, SUBB3, SUBF3, SUBI3, X0R3. 

This also applies to all the pertinent parallel instructions. 

® All 3-operand comparison instructions can be written without the '3'. 
For example, 

CMPI3 ro,*aro can be written as cmpi ro,*aro 

Instructions affected: CMPI3, CMPF3, TSTB3. 

© Indirect operands with an explicit displacement are allowed. In 
3-operand or parallel instructions, operands with displacement are 
automatically converted to "no-displacement" mode. For example: 

LDI *+AR0(0),R1 

is legal 
Also 

ADDI3 *+ARO(0) ,R1,R2 is equivalent to ADDIS *AR0,R1,R2 

© Indirect operands can be written with no displacement, in which case a 
displacement of one is assumed. For example, 

LDI *AR0++(1),R0 can be written LDI *ARO++,RO 

All conditional instructions accept the suffix 'U' to indicate uncondi- 
tional operation. Also, the U can be omitted from unconditional short 
branch instructions. For example: 

BU label can be written B label 

© Labels can be written with or without a trailing colon. For example: 

labelO: NOP 
labell NOP 
label2: 

NOP 
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Empty expressions are not allowed for the displacement in indirect 
mode: 

LDI *+ARO(),RO is not legal 

Long immediate mode operands (destination of BR and CALL) can be 
written with an at-sign: 

BR label can be written BR @label 

The LDP pseudo-op can be used to load a register (usually DP) with the 
8 MSBs of a relocatable address. The instruction is written: 

LDP addr,REG or LDP (aaddr,REG 

The at-sign is optional. 

If the destination REG is the DP, it can be omitted. LDP generates a LDI 
instruction with an immediate operand, and a special relocation type. 

Parallel instructions can be written in either order. For example: 

ADDI can be written as STi 

I I STI I I ADDI 

The parallel bars indicating part 2 of a parallel instruction can be written 
anywhere on the line, from column to the mnemonic. For example: 

ADDI can be written as addi 

I I STI I I STI 

If the second operand of a parallel instruction is the same as the third 
(destination register) operand, the third operand can be omitted. This 
allows the writing of 3-operand parallel instructions that 'look like' nor- 
mal 2-operand instruction. For example, 

ADDI *AR0,R2,R2 Can be written as ADDI *AR0,R2 

1 I MPYI *AR1,R0,R0 | j MPYI *AR1,R0 

Instructions (applies to all parallel instructions that have a register sec- 
ond operand) affected: ADDI, ADDF, AND, MPYI, MPYF, OR, SUBI, 
SUBF, XOR. 

All commutive operations in parallel instructions can be written in either 
order. For example, the ADDI part of a parallel instruction can be written 
in either of two ways: 

ADDI *AR0,R1,R2 or ADDI R1,*AR0,R2 

The instructions affected are parallel instructions containing any of the 
following: ADDI, ADDF, MPYI, MPYF, AND, OR, XOR. 
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11.3.3 Individual Instruction Descriptions 

Each assembly language instruction for the TMS320C30 is described in this 
section. The instructions are listed in alphabetical order. An example instruc- 
tion precedes the individual instructions to show the special format used and 
explain its content. This example instruction describes the assembler syntax, 
operation, operands, encoding, description, cycles, status bits, mode bit, and 
examples. 
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EXAMPLE 



Example Instruction 



Syntax 



Operation 



Operands 




INST <src>,<dst> 

or 

INST1 <src2>,<dst1> 
II INST2 <src3>,<dst2> 

Each instruction begins witii an assembler syntax expression. Labels may 
be placed either preceding the command (instruction mnemonic) on the 
same line or on the preceding line in the first column. The optional com- 
ment field that concludes the syntax is not included in the syntax ex- 
pression. Space(s) are required between each field (label, command, 
operand, and comment fields). 

The syntax examples illustrate the common one-line syntax and the two- 
line syntax used in parallel addressing. Note that the two vertical bars || that 
indicate a parallel addressing pair can be placed anywhere before the mne- 
monic on the second line. The first instruction in the pair can have a label, 
but the second instruction cannot have a label. 



\src\ 
or 



dst 



\src2\ -^ dsti 
1 1 src3 ->■ dstZ 

The instruction operation sequence describes the processing that takes 
place when the instruction is executed. For parallel instructions, the opera- 
tion sequence is performed in parallel. Conditional effects of status register 
specified modes will be listed for conditional instructions such as Bcond. 

src general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n < 27) 



or 

src2 
dst 7 
src3 
dst2 



indirect (disp = 0, 1, IRO, IR1) 
register (Rnl, ^ n1 ^7) 
register (Rn2, ^ n2 £ 7) 
indirect (disp = 0, 1, IRO, IR1) 



Operands are defined according to the addressing mode and/or the type of 
addressing used. Note that indirect addressing uses displacements and the 
index registers. Refer to Section 6 for detailed information on addressing. 
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Example instruction 



EXAMPLE 



Encoding 



31 






24 23 








1615 




87 










1 r 1 1 1 

INST 


G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 


or 
31 2423 1615 




87 







— 1 — 
1 1 


INS1IIINS2 


1 1 
dsti 


1 1 



1 ( 
src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 1 1 1 
src2 



Description 



Cycles 



Status Bits 



Encoding examples are shown using general addressing and parallel ad- 
dressing. The instruction pair for the parallel addressing example consists 
of INS1 and INS2. 

Instruction execution and its effect on the rest of the processor or memory 
contents are described. Any constraints on the operands imposed by the 
processor or the assembler are discussed. The description parallels and 
supplements the information given by the operation block. 

1 

The digit specifies the number of cycles required to execute the instruction. 

N Negative Condition Flag. 1 if a negative result is generated, 
otherwise. In some instructions, this flag is the MSB of the output. 
For other instructions, this flag is unaffected. 

Z Zero Condition Flag. 1 if a zero result is generated, otherwise. 

For logical and shift instructions, 1 if a zero output is generated, 
otherwise. This flag may be unaffected. 

V Overflow Condition Flag. 1 if an integer or floating-point over- 
flow occurs, otherwise. This flag may be unaffected. 

C Carry Flag. 1 if a carry or borrow occurs, otherwise. For shift 
instructions, this flag is set to the value of the last bit shifted out; 
for a shift count of 0. This flag may be unaffected. 

UF Floating-Point Underflow Condition Flag. If a floating-point 
underflow occurs, otherwise. This flag may be unaffected. 

LV Latched Overflow Condition Flag. 1 if an integer or floating- 
point overflow occurs, unchanged otherwise. This flag may be un- 
affected. 

LUF Latched Floating-Point Underflow Condition Flag. 1 if a 

floating-point underflow occurs, unchanged otherwise. This flag 
may be unaffected. 

The seven condition flags, stored in the status register (ST), are modified 
by the majority of instructions. They provide information as to the properties 
of the result or output of arithmetic or logical operations. 




/\/lode Bit 



OVM Overflow Mode Flag. 

by the OVM flag. 



In general, integer operations are affected 
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EXAMPLE 



Example Instruction 



Example INST @98AEh,R5 

Before Instruction: 

DP = 80h 

R5 = 0766900000h = 2.30562500e+02 

Memory at 8098AEh = 5CDFh = 1. 00001 107e+00 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R5 = 0066900000h = 1.80126953e+00 

Memory at 8098AEh = 5CDFh = 1. 00001 107e+00 

LUFLVUFNZVC = 0000000 

The sample code presented in the above format shows the effect of the 
code on system pointers (e.g., DP or SP), registers (e.g., R1 or R5), mem- 
ory at specific locations, and the seven status bits. The values given for the 
registers include the leading zeros to show the exponent in floating-point 
operations. Decimal conversions are provided for all register and memory 
locations. The seven status bits are listed in the order in which they appear 
in the assembler and simulator (see Table 11-9 and Section 1 1 .2 for further 
information on these seven status bits). 
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Absolute Value of Floating-Point 



ABSF 



Syntax 

Operation 

Operands 



ABSF <src>,<dst> 

\src\ -* dst 

src general addressing modes (G): 
register ( Rn, £ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 
31 



24 23 



1615 



87 



— I — I — I — I — I — I — I — I — 
000000000 



dst 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The absolute value of the src operand is loaded into the dst register. The 
src and dst operands are assumed to be floating-point numbers. 

An overflow occurs if src (man) = SOOOOOOOh and src (exp) = 7Fh. The 
result is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 

1 

N 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation not affected by OVM. 
ABSF R4,R7 
Before Instruction: 



R4 = 05C8000F971h = -9.90337307e+27 
R7 = 07D251100AEh = 5.48527255e+37 
LUFLVUFNZVC = 0000000 

After Instruction: 

R4 = 05C8000F971 h = -9.90337307e+27 
R7 = 05C7FFF068Fh = 9.90337307e+27 
LUFLVUFNZVC = 0000000 
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ABSFIISTF 



Parallel ABSF and STF 



Syntax 

Operation 

Operands 

Encoding 



ABSF <src2>,<dst1> 
II STF <src3> ,<dst2> 

\src2\ -* dsti 
\\src3 -^ dst2 

src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dsti register (Rnl, £ n1 ^7) 

src3 register (Rn2, ^ n2 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1 ) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
10 


1 1 
dsti 


1 1 



1 1 
src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 1 1 1 
src2 



Description 



Cycles 
Status Bits 




A floating-point absolute value and a floating-point store are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (ABSF) writes 
to the same register, then STF accepts as input the contents of the register 
before it is modified by the ABSF. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

An overflow occurs if src (man) = SOOOOOOOh and src (exp) = 7Fh. The 
result is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 



1 

N 

Z 
V 

c 

UP 
LV 





1 if a zero result is generated, otherwise. 

1 if a floating-point overflow occurs, otherwise. 

Unaffected. 



1 if a floating-point overflow occurs, unchanged otherwise. 



LUF Unaffected. 



l\/lode Bit OVM Operation not affected by OVM. 
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Parallel ABSF and STF ABSFIISTF 



Example ABSF *++AR3 (IRl) ,R4 

I I STF R4,*-AR7(1) 

Before Instruction: 

AR3 = 809800h 

IRl = OAFh 

R4 = 733C00000h = 1.79750e+02 

AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = -6.1 18750e+01 

Data at 8098C4h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 8098AFh 

IRl = OAFh 

R4 = 574C00000h = 6.118750e+01 

AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = -6.1 18750e+01 

Data at 8098C4h = 733C000h = 1 .79750e+02 

LUF LVUFNZVC = 0000000 
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ABSI 



Absolute Value of Integer 



Syntax 

Operation 

Operands 



ABSI <src>,<dst> 

I SAC I -♦ dst 

src general addressing modes (G): 
register (Rn, < n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 


1 
G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 



Cycles 
Status Bits 



The absolute value of the src operand is loaded into the dst register. The 
src and dst operands are assumed to be signed integers. 

An overflow occurs if src = SOOOOOOOh. If ST(OVM) = 1 , the result is dst 
= 7FFFFFFFh. If ST(OVM) = 0, the result is dst = OOOOOOOOh. 



1 

N 

Z 
V 

c 

UP 
LV 





1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

Unaffected. 



1 if an integer overflow occurs, unchanged otherwise. 



/\/lode Bit 



LUF Unaffected. 

OVM Operation affected by OVM. 
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Absolute Value of Integer 



ABSi 



Example ABSI RO,RO 

or ABSI RO 

Before Instruction: 

RO = OFFFFFFCBh =-53 
After Instruction: 
RO = 035h = 53 

Example ABSI *AR1,R3 

Before Instruction: 



AR1 = 20h 

R3 - Oh 

Data at 20h = OFFFFFFCBh = -53 

After Instruction: 



AR1 - 20h 

R3 = 35h - 53 

Data at 20h = OFFFFFFCBh = -53 
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ABSillSTI 



Parallel ABSI and STI 



Syntax 

Operation 

Operands 

Encoding 



ABSI <src2>,<dst1> 
II STI <src3> ,<dst2> 

\src2\ -* dsti 
\\src3 -*■ dst2 

src2 Indirect (disp = 0, 1, I RO, I R1) 

dsti register (Rnl, < n1 < 7) 

src3 register (Rn2, < n2 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
10 1 


1 1 
dsti 


1 1 



1 1 ' 
src3 


1 1 1 1 1 1 1 
dst2 


src2 



Description 



Cycles 
Status Bits 




Mode Bit 



An integer absolute value and an integer store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (ABSI) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the ABSI. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

An overflow occurs if src = SOOOOOOOh. If ST(OVM) = 1, the result is dst 
= 7FFFFFFFh. If ST(OVM) = 0, the result is dst = OOOOOOOOh. 



1 

N 

Z 
V 

c 

UP 
LV 





1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

Unaffected. 



1 if an integer overflow occurs, unchanged otherwise. 



LUF Unaffected. 

OVM Operation affected by OVM. 
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Parallel ABSI and STI ABSI||STI 



Example ABSI *-AR5(l),R5 

I I STI R1,*AR2 — (IRl) 

Before Instruction: 

AR5 = 8099E2h 

R5 = Oh 

R1 = 42h = 66 

AR2 = 8098FFh 

IRl = OFh 

Data at 8099E1 h = OFFFFFFCBh = -53 

Data at 8098FFh = 2h = 2 

LUFLVUFNZVC^OOOOOOO 

After Instruction: 

AR5 = 8099E2h 

R5 = 35h = 53 

R1 = 42h = 66 

AR2 = 8098F0h 

IRl = OFh 

Data at 8099E1h = OFFFFFFCBh = -53 

Data at 8098FFh = 42h = 66 

LUFLVUFNZVC^OOOOOOO 
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ADDC 



Add Integer with Carry 



Syntax 

Operation 

Operands 



ADDC <src>,<dst> 

dst + src + C -* dst 

src general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 




24 23 






1615 


87 





1 1 



1 1 



1 1 1 
1 


1 
G 


1 1 1 1 
dst 


1 1 1 1 



1 II 

src 


1 1 1 



Description 



Cycles 
Status Bits 



IVIode Bit 
Example 




The sum of the dst and src operands and the C (carry) flag is loaded into 
the dst register. The dst and src operands are assumed to be signed inte- 
gers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation affected by OVM. 
ADDC R1,R5 
Before Instruction: 



R1 = 00FFFF5C25h = -41,947 
R5 = 00FFFF019Eh = -65,122 
LUF LVUFNZVC = 0000001 

After Instruction: 



R1 = 00FFFF5C25h = -41,947 
R5 = 00FFFE5DC4h = -107,068 
LUF LV UF N Z V C = 1 



1 
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Add Integer With Carry, 3-Operand 



ADDC3 



Syntax 

Operation 

Operands 



Encoding 



ADDC3 <src2>,<src1>,<dst> 

srcl + src2 + C -^ dst 

srcl three-operand addressing modes (T): 
register ( Rnl, < n1 <, 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, £ n2 ^ 27) 

1 register (Rn2, ^ n2 ^ 27) 

1 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, ^ n ^ 27) 



31 


24 23 






1615 




87 







1 1 
1 


1 1 1 1 1 



1 
T 


1 1 1 1 
dst 


'1 1 1 1 1 1 
srcl 


1"" 1 1 1 I 1 1 
src2 



Description 



Cycles 
Status Bits 



The sum of the srcl and src2 operands and the C (carry) flag is loaded into 
the dst register. The srcl , src2, and dst operands are assumed to be signed 
integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



Mode Bit 



OVM Operation affected by OVM. 
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ADDC3 



Add integer With Carry, 3-Operand 



Example ADDC3 *AR5++(IR0) ,R5 ,R2 

or 

ADDC3 R5 , *AR5++ ( IRO ) , R2 

Before Instruction: 

AR5 = 809908h 

IRO = lOh 

R5 = 066h = 102 

R2 = 0h 

Data at 809908h = OFFFFFFCBh = -53 

LUF LV UF N Z V C = 1 

After Instruction: 

AR5 = 80991 8h 

IRO = lOh 

R5 = 066h = 102 

R2 = 032h = 50 

Data at 809908h = OFFFFFFCBh = -53 

LUF LV UF N Z V C = 1 

Example ADDC3 R2 , R7, RO 

Before Instruction: 

R2 = 02BCh = 700 

R7 = 0F82h = 3970 

RO = Oh 

LUF LV UF N Z V C = 1 

After Instruction: 

R2 = 02BCh = 700 
R7 = 0F82h = 3970 
RO = 0123Fh = 4671 
LUFLVUFNZVC = 0000000 
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Add Floating-Point 



ADDF 



Syntax 

Operation 

Operands 



ADDF <src>,<dst> 

dst + src -* dst 

src general addressing modes (G): 
register ( Rn, ^ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 



31 


24 23 






1615 


87 








1 1 


G 


dst 


src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



Tine sum of the dst and src operands is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

ADDF *AR4++ ( IRl ) , R5 

Before Instruction: 

AR4 = 809800h 

IRl = 12Bh 

R5 = 0579800000h = 6.23750e+01 

Data at 80992Bh = 86B2800h = 4.7031 250e+02 

LUFLVUFNZVC = 0000000 

After Instruction: 




AR4 = 80992Bh 

IRl = 12Bh 

R5 = 09052C0000h = 5.3268750e+02 

Data at 80992Bh = 86B2800h = 4.7031 250e+02 

LUFLVUFNZVC = 0000000 
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ADDF3 



Add Floating-Point, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



ADDF3 <src2>,<src1>,<dst> 

srcl + src2 -^ dst 

srcl three-operand addressing modes (T): 
register ( Rnl, ^ n1 ^7) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register ( Rnl, ^ n1 :< 7) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 7) 
1 register (Rn2, :S n2 :^ 7) 

10 indirect (disp = 0, 1, IRO, IR1) 

11 indirect (disp - 0, 1, IRO, IR1) 

dst register (Rn, < n < 7) 



31 24 23 

— I — I — I — I — I — I — I — I — 
1 1 



1615 



87 



T — I — I — r 
dst 



T — I — I — I — I — r 
srcl 



I I I I I r 
src2 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The sum of the srcl and src2 operands is loaded into the dst register. The 
srcl , src2, and dst operands are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

ADDF3 R6,R5,R1 

or 

ADDF3 R5,R6,R1 

Before Instruction: 



R6 = 086B280000h = 4.7031 250e+02 

R5 = 0579800000h = 6.23750e+01 

R1 = Oh 

LUFLVUFNZVC = 0000000 

After instruction: 

R6 = 086B280000h = 4.7031 250e+02 
R5 = 0579800000h = 6.23750e+01 
R1 = 09052C0000h = 5.3268750e+02 
LUF LVUFNZVC = 0000000 
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Add Floating-Point. 3-Operand ADDF3 

Example ADDF3 *+ARl( 1) ,*AR7++(IR0) ,R4 

Before Instruction: 

AR1 = 809820h 

AR7 = 8099F0h 

IRO = 8h 

R4 = Oh 

Data at 809821 h = 700F000h = 1.28940e+02 

Data at 8099F0h = 34C2000h = 1 .27590e+01 

LUFLVUFNZVC = 0000000 

After instruction: 

AR1 = 809820h 

AR7 = 8099F8h 

IRO = 8h 

R4 = 070DB20000h = 1.41 69531 3e+02 

Data at 809821 h = 700F000h = 1 .28940e+02 

Data at 8099F0h = 34C2000h = 1 .27590e+01 

LUF LVUFNZVC^OOOOOOO 
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ADDF3IISTF 



Parallel ADDF3 and STF 



Syntax 

Operation 

Operands 

Encoding 



ADDF3 <src2>,<src1>,<dst1> 
II STF <src3> ,<dst2> 



srcl + src2 
src3 -* dstl 



dsti 



srcl register (Rnl, ^ n1 ^7) 

src2 indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rn2, £ n2 ^ 7) 

src3 register (Rn3, :S n3 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 1 


110 


1 1 
dstl 


1 1 
srcl 


1 1 
src3 


1 1 1 1 1 1 1 
dst2 


' 1 1 1 1 1 1 1 
src2 



Description 



Cycles 
Status Bits 



A floating-point addition and a floating-point store are performed in paral- 
lel. All registers are read at the beginning and loaded at the end of the ex- 
ecute cycle. This means that if one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (ADDF3) 
writes to the same register, then STF accepts as input the contents of the 
register before it is modified by the ADDF3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



Mode Bit 



OVM Operation not affected by OVM. 
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Parallel ADDF3 and STF ADDF3IISTF 



Example ADDF3 *+AR3 (IRl) ,R2 ,R5 

I I STF R4,*AR2 

Before Instruction: 

AR3 = 809800h 

IRl = 0A5h 

R2 = 070C800000h = 1 .4050e+02 

R5 = Oh 

R4 = 057B400000h = 6.281 250e+01 

AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1 .79750e+02 

Data at 8098F3h = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 

AR3 = 809800h 

IRl = 0A5h 

R2 = 070C800000h = 1 .4050e+02 

R5 = 0820200000h = 3.20250e+02 

R4 = 057B400000h = 6.281 250e+01 

AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1 .79750e+02 

Data at 8098F3h = 57B4000h = 6.281 25e+01 

LUFLVUFNZVC = 0000000 
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ADDI 



Add Integer 



Syntax 

Operation 

Operands 



ADDI <src>,<dst> 

dst + src -» dst 

src general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, < n :S 27) 



Encoding 
31 



24 23 



1615 



87 



— I — I — I — I — I — I — I — I — 
0000001 00 



dst 



Description 



Cycles 
Status Bits 



l\Aode Bit 



Example 




The sum of the dst and src operands is loaded into the the dst register. The 
dst and src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation affected by OVM. 

ADDI R3,R7 

Before Instruction: 

R3 = OFFFFFFCBh = -53 

R7 = 35h = 53 

LUFLVUFNZVC = 0000000 

After Instruction: 

R3 = OFFFFFFCBh = -53 

R7 = Oh 

LUF LV UF N Z V C = 1 
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Add Integer, 3-Operand 



ADDIS 



Syntax 

Operation 

Operands 



Encoding 



ADDIS <src2>,<src1>,<dst> 

srcl + src2 -» dst 

src1 three-operand addressing modes (T): 
register (Rnl, £ n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1 ) 

1 register (Rnl, ^ n1 ^ 27) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 :S 27) 

1 register (Rn2, :S n2 ^ 27) 

1 indirect (disp = 0, 1, IRO, IR1 ) 
1 1 indirect (disp - 0, 1, IRO, IR1) 

dst register (Rn, ^ n £ 27) 



31 


24 23 






1615 




87 







1 


1 


T 


dst 


srcl 


src2 



Description 

Cycles 
Status Bits 



The sum of the srcl and src2 operands is loaded into the dst register. The 
srcl , src2, and dst operands are assumed to be signed integers. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

1 if a carry occurs, otherwise. 



1 if an integer overflow occurs, unchanged otherwise. 



l\^ode Bit 



OVM Operation affected by OVM. 
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ADDIS 



Add Integer, 3-Operand 



Example 



Example 




ADDI3 R4,R7,R5 
Before Instruction: 

R4 = ODCh = 220 

R7 = OAOh = 1 60 

R5 = lOh = 16 

LUFLVUFNZVC = 0000000 

After Instruction: 

R4 = ODCh = 220 
R7 = OAOh = 1 60 
R5 = 017Ch = 380 
LUFLVUFNZVC = 0000000 

ADDIS *-AR3+(l) ,*AR6 — (IR0),R2 
Before Instruction: 

AR3 = 809802h 

AR6 = 809930h 

IRO = 18h 

R2 = lOh = 16 

Data at 809801 h = 2AF8h = 1 1 ,000 

Data at 809930h = 3A98h = 1 5,000 

LUFLVUFNZVC = 0000000 

After Instruction: 



AR3 = 809852h 

AR6 = 80991 8h 

IRO = 18h 

R2 = 06598h = 26,000 

Data at 809801 h = 2AF8h = 1 1 ,000 

Data at 809930h = 3A98h = 1 5,000 

LUFLVUFNZVC = 0000000 
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Parallel ADDI3 and STI 



ADDI3IISTI 



Syntax 



Operation 



Operands 



Encoding 



ADDIS <src2>,<src1><dst1> 
STI <src3>,<dst2> 



srcl + src2 
\\src3 ->■ dst2 



dsti 



srcl register (Rnl, ^ n1 £ 7) 

src2 indirect (disp = 0, 1, I RO, I R1) 

dsti register (Rn2, ^ n2 ^ 7) 

src3 register (Rn3, ^ n3 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 1 


111 


dsti 


srcl 


src3 


I 1 1 1 1 1 1 
dst2 


src2 



Description 



Cycles 
Status Bits 



l\/lode Bit 
Example 



An integer addition and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (ADDIS) writes to the same 
register, then STI accepts as input the contents of the register before it is 
modified by the ADDIS. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

1 if a carry occurs, otherwise. 



1 if an integer overflow occurs, unchanged otherwise. 



OVM Operation affected by OVM. 

ADDI3 *ARO — (IR0),R5,R0 
I I STI R3,*AR7 

Before Instruction: 




ARO = 80992Ch 

IRO = OCh 

R5 = ODCh = 220 

RO = Oh 

R3 = 35h = 5S 

AR7 = 8098SBh 

Data at 80992Ch = 

Data at 80983Bh - 

LUF LV UF N Z 



12Ch = SOO 

Oh 

VC-OOOOOOO 
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ADDI3IISTI Parallel ADDI3 and STI 



After Instruction: 

ARO = 809920h 

IRO = OCh 

R5 = ODCh = 220 

RO = 208h = 520 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = 35h = 53 

LUFLVUFNZVC = 0000000 
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Bit>A/ise Logical-AND 



AND 



Syntax 

Operation 

Operands 



AND <src>,<cfst> 

dst AND src -* dst 

src general addressing modes (G): 
register (Rn, ^ n £ 27) 

1 direct 

1 indirect 

1 1 immediate (not sign-extended) 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 








1 1 


G 


dst 


src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The bitwise logical-AND between the dst and src operands is loaded into 
the dst register. The dst and src operands are assumed to be unsigned in- 
tegers. 

1 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 

C Unaffected. 

UF 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

AND R1,R2 

Before Instruction: 



R1 = 80h 
R2 = OAFFh 
LUF LV UF N Z 

After Instruction: 



VC = 0000001 



R1 = 80h 
R2 = 80h 
LUF LV UF N Z V C = 1 
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AND3 



Bitwise Logical-AND, 3-Operand 



Syntax 

Operation 

Operands 



AND <src2>,<src1>,<dst> 

srcl AND src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 ^ 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, :S n1 ^ 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 ^ 27) 

1 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1 '"" '""^ 

dst register (Rn, ^ n ^ 27) 



IRO, IR1) 



Encoding 
31 



24 23 



1615 



— r— T — 1 — I — 1 — 
1 1 



T — I — r 
dst 



T — I — I — I — I — r 
srcl 



87 

T — I — I — I — I — I — r 
src2 



1 



Description 



Cycles 
Status Bits 



The bitwise logical-AND between the srcl and srcl operands is loaded into 
the dst register. The srcl , srcl, and dst operands are assumed to be un- 
signed integers. 



1 

N 

Z 
V 

c 

UF 
LV 



MSB of the output. 

1 if a zero output is generated, otherwise. 



Unaffected. 



Unaffected. 



Mode Bit 



LUF Unaffected. 

OVM Operation not affected by OVM. 
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Bit\A/ise Logical-AND, 3-Operand 



AND3 



Example 



Example 



AND3 *ARO — (IRO) ,*+ARl,R4 
Before Instruction: 

ARO = 8098F4h 

IRO = 50h 

AR1 = 809951 h 

R4 = Oh 

Data at 8098A4h = 30h 

Data at 809952h = 123h 

LUFLVUFNZVC = 0000000 

After Instruction: 



ARO = 8098A4h 

IRO = 50h 

AR1 = 809951 h 

R4 = 020h 

Data at 8098A4h = 30h 

Data at 809952h = 123h 

LUF LVUFNZVC = 0000000 

AND3 *-AR5,R7,R4 
Before Instruction: 

AR5 = 80985Ch 

R7 = 2h 

R4 = Oh 

Data at 80985Bh = OAFFh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 80985Ch 

R7 = 2h 

R4 = 2h 

Data at 80985Bh = OAFFh 

LUFLVUFNZVC = 0000000 
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AND3I1STI 



Parallel AND3 and STI 



Syntax 

Operation 

Operands 



Encoding 
31 



AND <src2>,<src1>,<dst1> 
II STI <src3> ,<dst2> 



src 7 AND src2 
src3 ->■ dst2 



dsti 



srcl register (Rnl, :S n1 ^7) 

src2 indirect (disp = 0, 1, IRO, 1R1) 

dsti register (Rn2, < n2 ^ 7) 

src3 register (Rn3, :^ n3 ^ 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1 ) 



24 23 

T — I — I — I — I — I — r 
110 10 dsti 



1615 



87 



— I — r 
srcl 



T — 1 — r 
dst2 



src3 



src2 



Description 



Cycles 
Status Bits 



Mode Bit 



A bitwise logical-AND and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (AND3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the AND3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 



MSB of the output. 

1 if a zero output is generated, otherwise. 



Unaffected. 



Unaffected. 



OVM Operation not affected by OVM. 
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Parallel AND3 and STI AND3||STI 



Example AND3 *+ARl(IRO) ,R4 ,R7 

I I STI R3,*AR2 

Before Instruction: 

AR1 = 8099F1h 

IRO = 8h 

R4 = 0A323h 

R7 = Oh 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 



AR1 = 8099F1 h 

IRO = 8h 

R4 = 0A323h 

R7 = 03h 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = 35h = 53 

LUFLVUFNZVC = 0000000 
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ANDN 



Bit\A/ise Logical-AND vj'ith Complement 



Syntax 

Operation 

Operands 



ANDN <src>.<dst> 
dst AND ~src -* dst 

src general addressing modes (G): 

register (Rn, < n ^ 27) 

1 direct 

1 indirect 

1 1 immediate (not sign-extended) 

dst register (Rn, ^ n £ 27) 



Encoding 
31 



24 23 



1615 



87 



110 



dst 



Description The bitwise logical-AND between the dst operand and the bitwise logical 
complement (~) of the src operand is loaded into the dst register. The dst 
and src operands are assumed to be unsigned integers. 

Cycles 1 



Status Bits 


N 

Z 
V 

c 

UF 
LV 
LUF 


MSB of the output. 

1 if a zero output is generated, otherwise 



Unaffected. 



Unaffected. 

Unaffected. 


IVIode Bit 


OVM Operation not affected by OVM. 


Example 


ANDN 


@980Ch,R2 




Befo 


re Instruction: 




DP = 80h 

R2 = 0C2Fh 

Data at 80980Ch = 0A02h 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R2 = 04^ h 

Data at 80980Ch = 0A02h 

LUFLVUFNZVC = 0000000 
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Bitwise Logical -AIM DN, 3-Operand 



ANDIM3 



Syntax 

Operation 

Operands 



Encoding 

31 



AN DNS <src2>,<src1>,<dst> 

srcl AND ~src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 i^ 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

src2 three-operand addressing modes (T): 
register (Rn2, £ n2 ^ 27) 

1 register (Rn2, £ n2 ^ 27) 
10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, 100, IR1) 

dst register (Rn, ^ n :S 27) 



24 23 



1 — I — I — r 



T — I — I — r 
dst 



1615 87 

T — I — I — I — I — I — r 



T — 1 — I — I — I — I — r 
src2 



10 10 



srcl 



Description The bitwise logical-AND between the srcl operand and the bitwise logical 
complement (~) of the src2 operand is loaded into the dst register. The 
srcl , src2, and dst operands are assumed to be unsigned integers. 

Cycles 1 



Status Bits 


N 


MSB of the output. 




Z 


1 if a zero output is generated, otherwise 




V 







c 


Unaffected. 




UF 







LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation not affected by OVM. 



Example 



ANDN3 R5,R3,R7 

Before Instruction: 

R5 = 0A02h 

R3 = 0C2Fh 

R7 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = 042Dh 
LUFLVUFNZVC = 0000000 
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ANDN3 Bitwise Logical-ANDN, 3-Operand 

Example ANDN3 Rl, *AR5++(IR0) ,R0 

Before Instruction: 

Rl = OCFh 

AR5 = 809825h 

IRO = 5h 

RO = Oh 

Data at 809825h = OFFFh 

LUFLVUFNZVC = 0000000 

After Instruction: 




Rl = OCFh 

AR5 = 80982Ah 

IRO = 5h 

RO = 0F30h 

Data at 809825h = OFFFh 

LUFLVUFNZVC-0000000 
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Arithmetic Shift 



ASH 



Syntax 
Operation 



Operands 



Encoding 



ASH <count>,<dst> 

If {count >_Q): 

dst < < count -*■ dst 
Else: 

dst >> \count\ -* dst 

count general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (RN, ^ n ^ 27) 



31 


24 23 






1615 


87 








111 


G 


dst 


count 



Description 



Cycles 
Status Bits 



The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 
C- dst-^ 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are sign-extended as it is right-shifted. Low-order bits are shifted out 
through the C (carry) bit. 

Arithmetic right-shift: 

-» sign of dst -* C 

If the count operand is zero, no shift is performed, and the C (carry) bit is 
set to 0. The count and dst operands are assumed to be signed integers. 



N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

Unaffected if dst is not RO - R7. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 




Mode Bit 



LUF Unaffected. 

OVM Operation not affected by OVM. 
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ASH 



Arithmetic Shift 



Example 



Example 




ASH R1,R3 
Before Instruction: 

R1 = lOh = 16 
R3 = OAEOOOh 
LUFLVUFNZVC = 0000000 

After Instruction: 



R1 = lOh 

R3 = OEOOOOOOOOh 

LUF LV UF N Z V C = 1 1 1 

ASH @98C3h,R5 
Before Instruction: 



DP = 80h 

R5 = OAEC00001 h 

Data at 8098C3h = 0FFE8 = -24 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R5 = OFFFFFFAEh 

Data at 8098C3h = 0FFE8 = -24 

LUF LVUFNZVC = 0001 001 
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Arithmetic Shift, 3-Operand 



ASH3 



Syntax ASH3 <count> <src> ,<dst> 

Operation \i {count >_0): 

src < < count -*■ dst 
Else: 
src >> \count\ -* dst 

Operands count three-operand addressing modes (T): 

register (Rnl, < n1 ^ 27) 

1 direct (disp = 0,1, IRO, IR1) 

1 register (Rnl, ^ n1 ^ 27) 
11 indirect (disp = 0, 1, IRO, IR1) 

src three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 ^ 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 
1 1 indirect (disp = 0, 1, lOO, IR1) 

dst register (Rn, ^ n £ 27) 



Encoding 
31 



24 23 

— I — I — I — I — I — I — I — I — 
10 10 1 



1615 



87 



T — r 
dst 



T — I — r 
src 



T — I — I — r 
count 



Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the src operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 
C*- src*- 

If the count operand is less than zero, the src operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the src operand 
are sign-extended as it is right-shifted. Low-order bits are shifted out 
through the C (carry) bit. 

Arithmetic right-shift: 

-♦ sign of dst -*■ C 

If the count operand is zero, no shift is performed, and the C (carry) bit is 
set to 0. The count, src, and dst operands are assumed to be signed inte- 
gers. 

Cycles 1 
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ASH3 



Arithmetic Shift, 3-Operand 



Status Bits 



Mode Bit 
Example 



N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

Unaffected if dst is not RO - R7. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ASH3 *AR3 — ( 1) ,R5 ,R0 

Before Instruction: 

AR3 = 809921 h 

R5 = 02B0h 

RO = Oh 

Data at 809921 h = 1 0h = 1 6 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 809920h 

R5 = 000002B0h 

RO = 02B00000h 

Data at 809921 h = lOh = 16 

LUF LVUFNZVC = 0000000 



Example 




ASH3 R1,R3,R5 

Before Instruction: 

R1 = OFFFFFFFSh = -8 

R3 = OFFFFCBOOh 

R5 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = 0FFFFFFF8h = -8 

R3 = OFFFFCBOOh 

R5 = OFFFFFFCBh 

LUF LV UF N Z V C = 1 



11-50 



Parallel ASH3 and STI 



ASH3IISTI 



Syntax 
Operation 

Operands 

Encoding 



ASH 3 <count>,<src2>,<dst1> 
II STI <src3> ,<dst2> 

If {count > 0): 

src2 << count -* dsti 
Else: 

src2 >> \count\ -* dsti 
1 1 src3 -+ dst2 

count register (Rnl, ^ n1 :S 7) 
src2 indirect (disp = 0, 1, I RO, I R1) 
dst7 register (Rn2, ^ n2 ^ 7) 
src3 register (Rn3, ^ nS ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IR1 ) 



31 




24 23 




1615 




87 







1 1 


10 1 


dsti 


count 


src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 1 1 1 
src2 



Description 



The seven least-significant bits of the count operand register are used to 
generate the two's-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 
C ^ src2 *- 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are sign-extended as it is right-shifted. Low-order bits are shifted out 
through the C (carry) bit. 

Arithmetic right-shift: 

->■ sign of src2 -* C 

If the count operand is zero, no shift is performed, and the C (carry) bit is 
set to 0. The count and dst operands are assumed to be signed integers. 

All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (ASH3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the ASH3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 




Cycles 
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ASH3IISTI 



Parallel ASH3 and STI 



Status Bits 



Mode Bit 
Example 




N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ASH3 R1,*AR6++(IR1) ,R0 
I I STI R5,*AR2 

Before Instruction: 

AR6 = 809900h 

IR1 = 8Ch 

R1 = OFFESh = -24 

RO = 0h 

R5 = 35h = 53 

AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = Oh 

LUFLVUFNZVC=0000000 

After Instruction: 



AR6 = 80998Ch 

IR1 = 8Ch 

R1 = OFFESh = -24 

RO = OFFFFFFAEh 

R5 = 35h = 53 

AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = 35h = 53 

LUF LV UF N Z V C = 1 
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Branch Conditionally (Standard) 



Bcond 



Syntax 
Operation 



Operands 

Encoding 
31 



Bcond <src> 

If cond is true: 
\i src is in register addressing mode (Rn 0^n^27), 

src -» PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 1 -> PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 



24 23 



1615 



87 



\ I I I r I I I r 
01 1 01 OBOOOO 



— 1 — I — r 
cond 



-1 — I — I — I — I — I — I — I — 1 — I — r 
register or displacement 



Description 



Cycles 



Bcond signifies a standard branch that executes in four cycles. A branch 
is performed if the condition is true. If the src operand is expressed in reg- 
ister addressing mode, the contents of the specified register are loaded into 
the PC. If the src operand is expressed in PC-relative mode, the assembler 
generates a displacement: displacement = label - (PC of branch instruction 
+ 1). This displacement is stored as a 16 bit signed integer in the 16 least 
significant bits of the branch instruction word. This displacement is added 
to the PC of the branch instruction plus 1 to generate the new PC. 

The TI\/1S320C30 provides 20 condition codes that can be used with this 
instruction (see Section 1 1 .2 for a list of condition mnemonics, encoding, 
and flags). 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation not a 


Example 


BZ 


RO 




Befo 


re Instruction: 




PC = 2B00h 

RO = 0003FF00h 

LUFLVUFNZVC-0000000 

After Instruction: 



PC = SFFOOh 

RO = OOOSFFOOh 

LUFLVUFNZVC = 0000000 
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BcondD 



Branch Conditionally (Delayed) 



Syntax 
Operation 



Operands 

Encoding 
31 



BcondD <src> 

If cond is true: 
If src is in register addressing mode (Rn 0^n<27), 

src -* PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 3 -» PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 



24 23 



1615 



87 



I I I I I I I I r 
1 1 1 B 1 



T — I — r— 
cond 



T 1 I I i I I I I I I I T 

register or displacement 



Description BcondD signifies a delayed branch that allows the three instructions after 
the delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. 

A branch is performed if the condition is true. If the src operand is ex- 
pressed in register addressing mode, the contents of the specified register 
are loaded into the PC. If the src operand is expressed in PC-relative mode, 
the assembler generates a displacement: displacement = label - (PC of 
branch instruction + 3). This displacement is stored as a 16 bit signed in- 
teger in the 16 least significant bits of the branch instruction. This dis- 
placement is added to the PC of the branch instruction plus 3 to generate' 
the new PC. The TMS320C30 provides 20 condition codes that can be 
used with this instruction (see Section 1 1 .2 for a list of condition mne- 
monics, encoding, and flags). 

Cycles 1 




Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


l\/lode Bit 


OVM 


Operation not affe 


Example 


BNZD 


36 (36 = 24h) 




Before Instruction: 




PC = 


50h 




LUF 


LV UF N Z V C 





After instruction: 

PC = 77h 

LUFLVUFNZVC = 0000000 
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Branch Unconditionally (Standard) 



BR 



Syntax 

Operation 

Operands 

Encoding 

31 



BR <src> 

src -* PC 

src long-immediate addressing mode 



87 



! 1 1 1 1 1 

110 





T 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



BR signifies a standard branch that executes in four cycles. An uncondi- 
tional branch is performed. The src operand is assumed to be a 24-bit un- 
signed integer. Note that bit 24 = for a standard branch. 



N 

Z 
V 

c 

UF 
LV 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



LUF Unaffected. 

OVM Operation not affected by OVM. 

BR 805Ch 

Before Instruction: 



PC = 80h 

LUF LV UF N Z V C 

After Instruction: 

PC - 805Ch 

LUF LV UF N Z V C 
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BRD 



Branch Unconditionally (Delayed) 



Syntax 

Operation 

Operands 

Encoding 

31 



BRD <src> 

src -* PC 

src long-immediate addressing mode 



24 23 



1615 



87 



1 1 1 r 1 1 — 

110 


1 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description BRD signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. 

An unconditional branch is performed. The src operand is assumed to be a 
24-bit unsigned integer. Note that bit 24 = 1 for a delayed branch. 



Cycles 


1 


Status Bits 


N Unaffected. 




Z Unaffected. 




V Unaffected. 




C Unaffected. 




UF Unaffected. 




LV Unaffected. 




LUF Unaffected. 


Mode Bit 


OVM Operation not affected by OVM 


Example 


BRD 2Ch 




Before Instruction: 




PC = IBh 

LUFLVUFNZVC = 0000000 

After Instruction: 



PC - 2Ch 

LUF LVUFNZVC = 0000000 
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Call Subroutine 



CALL 



Syntax 
Operation 

Operands 
Encoding 
31 



CALL <src> 

Next PC -» • ++$? 
src -*■ PC 

src long-immediate addressing mode 



24 23 



1615 



87 



1 1 1 1 1 1 

110 1 





1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description A call is performed. The next PC value is pushed onto the system stack. The 
src operand is loaded into the PC. The src operand is assumed to be a 
24-bit unsigned immediate operand. 



Cycles 


4 


Status Bits 


N Unaffected. 




Z Unaffected. 




V Unaffected. 




C Unaffected. 




UF Unaffected. 




LV Unaffected. 




LUF Unaffected. 


Mode Bit 


OVM Operation not a 


Example 


CALL 123456h 




Before Instruction: 



PC = 5h 

SP = 809801 h 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC = 123456h 

SP = 809802h 

Data at 809802h = 6h 

LUFLVUFNZVC = 0000000 
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CALLcond 



Call Subroutine Conditionally 



Syntax 
Operation 



Operands 

Encoding 
31 



CALLcond <src> 

If cond is true: 
Next PC - ' ++5? 
If SAC is in register addressing mode (Rn 0<n^27), 

src -^ PC. 
If src is in PC-relative mode (label or address), 
displacement + PC + 1 -» PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 



24 23 



1615 



87 



— I 1 — I — I — I — I — I — I — I — 

011100B0000 



T — I — I — r 
cond 



T — I — I — I — I — I — I — I — : — I — I — I — I — I — r 
register or displacement 



Description 



Cycles 




A call is performed if the condition is true. If the condition is true, the next 
PC value is pushed onto the system stack. If the src operand is expressed 
in register addressing mode, the contents of the specified register are loaded 
into the PC. If the src operand is expressed in PC- relative mode, the as- 
sembler generates a displacement: displacement = label - (PC of call in- 
struction + 1 ). This displacement is stored as a 1 6-bit signed integer in the 
1 6 least significant bit of the call instruction word. This displacement is 
added to the PC of the call instruction plus 1 to generate the new PC. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


l\/lode Bit 


OVM 


Operation n 
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Call Subroutine Conditionally CALLcond 

Example CALLNZ R5 

Before Instruction: 



PC = 123h 

SP = 809835h 

R5 = 789h 

LUFLVUFNZVC = 0000000 

After Instruction: 

PC = 789h 

SP = 809836h 

R5 = 789h 

Data at 809836h = 124h 

LUFLVUFNZVC = 0000000 
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CMPF 



Compare Floating-Point 



Syntax 

Operation 

Operands 



CMPF <src><dst> 
dst - src 

src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 



31 


24 23 






1615 


87 





1 1 



10 


G 


dst 


src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and 
src operands are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

CMPF *+AR4,R6 

Before Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1.4050e+02 

Data at 8098F3h = 070C8000h - 1 .4050e+02 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1 .4050e+02 

Data at 8098F3h = 070C8000h = 1.4050e+02 

LUF LV UF N Z V C = 1 
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Compare Floating-Point, 3-Operand 



CMPF3 



Syntax 

Operation 

Operands 



CMPF3 <src2>,<src1> 

srcl - src2 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 < 7) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rnl, ^ n1 ^7) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 7) 

1 register (Rn2, < n2 ^ 7) 

1 indirect (disp = 0, 1, IRO, IRI) 
1 1 indirect (disp = 0, 1, IRO, IRI) 



Encoding 



31 


24 23 




1615 




87 







1 


110 


T 





1 1 1 1 1 1 1 
srcl 


src2 



Description 



Cycles 
Status Bits 



IVIode Bit 
Example 



The src2 operand is subtracted from the srcl operand. The result is not 
loaded into any register, thus allowing for nondestructive compares. The 
srcl and src2 operands are assumed to be floating-point numbers. Al- 
though this instruction has only two operands, it is designated as a three 
operand instruction since operands are specified in the three operand for- 
mat. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a flaoting-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



OVM Operation not affected by OVM. 
CMPF3 *AR2,*AR3 — (1) 
Before Instruction: 




AR2 = 809831 h 
AR3 = 809852h 
Data at 809831 h = 
Data at 809852h = 
LUF LV UF N Z 

After Instruction: 



77A7000h = 2.5044e+02 
57A2000h = 6.2531 25e+01 
VC = 0000000 



AR2 = 809831 h 

AR3 = 809851 h 

Data at 809831 h = 77A7000h = 2.5044e+02 

Data at 809852h = 57A2000h = 6.2531 25e+01 

LUF LV UF N Z V C = 1 
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CMPI 



Compare Integer 



Syntax 

Operation 

Operands 



CMPI <src>,<dst> 

dst - src 

src general addressing modes (G): 
register (Rn, < n < 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n < 27) 



Encoding 
31 



24 23 



1615 



87 



— I — I — I — I — I — I — I — I — 
1 1 



dst 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and 
src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation not affected by OVM. 
CMPI R3,R7 
Before Instruction: 



R3 = 898h = 2200 
R7 = 3E8h = 1000 
LUF LVUFNZVC = 0000000 

After Instruction: 

R3 = 898h = 2200 
R7 = 3E8h = 1000 
LUF LV UF N Z V C = 1 
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Compare Integer, 3-Operand 



CMPI3 



Syntax 

Operation 

Operands 



CMPI3 <src2>,<src1> 

srcl - src2 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rnl, ^ n1 ^ 27) 
11 indirect (disp = 0, IJRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, £ n2 ^ 27) 
10 indirect (disp = 0, 1, IRO, I R1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 



Encoding 
31 



24 23 

— I — I — I — I — I — I — I — I — 
10 111 



1615 







— I — I — r 
srcl 



87 

T — I — I — I — I — I — r 
src2 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



The src2 operand is subtracted from the srcl operand. The result is not 
loaded into any register, thus allowing for nondestructive compares. The 
srcl and src2 operands are assumed to be signed integers. Although this 
instruction has only two operands, it is designated as a three operand in- 
struction since operands are specified in the three operand format. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation not affected by OVM. 

CMPI3 R7,R4 
Before Instruction: 



R7 = 03E8h = 1000 
R4 = 0898h = 2200 
LUF LVUFNZVC = O00OO0O 

After Instruction: 

R7 = 03E8h = 1000 
R4 = 0898h = 2200 
LUFLVUFNZVC^OOOOOOO 




11-63 



DBcond 



Decrement and Branch Conditionally (Standard) 



Syntax 
Operation 



Operands 



Encoding 
31 



DBcond <ARn>, <src> 

ARn - 1 -♦ ARn 
If cond is true and ARn >. : 
If src is in register addressing mode (Rn 0<n£27) 

src -» PC. 
If src is in PC-relative mode (label or address) 
displacement + PC + 1 -►PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 

ARn register (0 ^ n ^ 7) 



2423 1615 



87 



I I I I I I I I I 
register or displacement 



110 11 



ARn 



cond 




Description DBcond signifies a standard branch that executes in four cycles. The spe- 
cified auxiliary register is decremented and a branch is performed if the 
condition is true and the specified auxiliary register is greater than or equal 
to zero. 

The auxiliary register is treated as a 24-bit signed integer. The most-signi- 
ficant eight bits are unmodified by the decrement operation. The compar- 
ison of the auxiliary register uses only the 24 least-significant bits of the 
auxiliary register. Note that the branch condition does not depend on the 
auxiliary register decrement. 

If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded into the PC. If the src operand is expressed 
in PC-relative addressing mode, the assembler generates a displacement: 
displacement = label - (PC of branch instruction + 1). This integer is 
stored as a 1 6 bit signed integer in the 1 6 least significant bits of the branch 
instruction word. This displacement is added to the PC of the branch in- 
struction plus 1 to generate the new PC. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 



Cycles 


4 




Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


IVIode Bit 


OVM 


Operation n 
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Decrement and Branch Conditionally (Standard) DBcond 

Example DBLT AR3,R2 

Before Instruction: 

PC = 5Fh 

AR3 = 12h 

R2 = 9Fh 

LUF LV UF N Z V C = 1 

After Instruction: 

PC = 9Fh 

AR3 = 11h 

R2 = 9Fh 

LUF LVUFNZVC = 0001 000 
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DBcondD Decrement and Branch Conditionally (Delayed) 



Syntax 
Operation 



Operands 



Encoding 
31 



DBcondD <ARn>, <src> 

ARn - 1 -► ARn 
If cond is true: 
If src is in register addressing mode (Rn 0^n<27) 

src -» PC 
If src is in PC-relative mode (label or address) 
displacement + PC + 3 -► PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 

ARn register (0 ^ n ^ 7) 



2423 1615 



87 



— I — I — I — I — r 

1 1 1 1 B 



— 1 — r- 
ARn 



-1 — I — I — I — r 
register or displacement 



1 I I r 
cond 



I I I r 




Description DBcondD signifies a delayed branch that allows the three instructions after 
the delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. The specified auxiliary register is decremented and a 
branch is performed if the condition is true and the specified auxiliary reg- 
ister greater than or equal to zero. 

The auxiliary register is treated as a 24-bit signed integer. The most-signi- 
ficant eight bits are unmodified by the decrement operation. The compar- 
ison of the auxiliary register uses only the 24 least-significant bits of the 
auxiliary register. Note that the branch condition does not depend on the 
auxiliary register decrement. 

If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded into the PC. If the src is expressed in 
PC-relative addressing, the assembler generates a displacement: displace- 
ment = label - (PC of branch instruction + 3). This displacement is added 
to the PC of the branch instruction plus 3 to generate the new PC. Note 
that bit 21 = 1 for a delayed branch. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


IVIode Bit 


OVM 


Operation n 
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Decrement and Branch Conditionally (Delayed) DBcondD 

Example DBZD AR5,$ + 110h 

Before Instruction: 

PC = Oh 

AR5 = 67h 

LUF LVUFNZVC-00001 00 

After instruction: 

PC = llOh 
AR5 = 66h 
LUF LVUFNZVC = 00001 00 
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FIX 



Floating-Point to Integer Conversion 



Syntax 

Operation 

Operands 



FIX <src>.<dst> 

fix (src) -*■ dst 

src general addressing modes (G): 
register (Rn, < n < 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 
31 



24 23 



1615 



87 



10 10 



-1 — I — I — I — I — I — I — I — I — I — I — I — I — I — r 
src 



dst 



Description 



Cycles 
Status Bits 




l\/lode Bit 



Example 



The floating-point operand src is converted to the nearest integer less than 
or equal to it in absolute value, and the result is loaded into the dst register. 
The src operand is assumed to be a floating-point number and the dst op- 
erand a signed integer. 

The exponent field of the result register (if it has one) is not modified. 

Integer overflow occurs when the floating-point number is too large to be 
represented as a 32-bit two's-compiement integer. In the case of integer 
overflow, the result will be saturated in the direction of overflow. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

FIX R1,R2 

Before Instruction: 

R1 = 0A282CCCCCh = -1.3454e+3 

R2 = 0h 

LUF LVUFNZVC = 0000000 

After Instruction: 

R1 = 0A282CCCCCh = -13454e+3 
R2 = 541h = 1345 
LUFLVUFNZVC = 0000000 
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Parallel FIX and STI 



FIXIISTI 



Syntax 



Operation 



Operands 



Encoding 
31 



FIX <src2>,<dst1> 
II STI <src3>,<dst2> 

f\x{src2) -* dsti 
1 1 src3 -► dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dsti register (Rnl, £ n1 < 7) 

src3 register (Rn2, £ n2 ^ 7) 

dst2 indirect (disp = 0, 1,IR0, IR1) 



24 23 



1615 



— I — I — I — I — I — I — 
110 10 10 



dsti 



— I — I — 




-T — r- 
src3 



T 1 1 1 

dst2 



87 

-| — I — I — I — I — I — r 
src2 



Description 



Cycles 
Status Bits 



Mode Bit 



A floating-point to integer conversion is performed. All registers are read 
at the beginning and loaded at the end of the execute cycle. This means 
that if one of the parallel operations (STI) reads from a register, and the 
operation being performed in parallel (FIX) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by FIX. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Integer overflow occurs when the floating-point number is too large to be 
represented as a 32-bit two's-complement integer. In the case of integer 
overflow, the result will be saturated in the direction of overflow. 



1 

N 

Z 
V 

c 

UF 
LV 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

Unaffected. 



1 if an integer overflow occurs, unchanged otherwise. 



LUF Unaffected. 

OVM Operation affected by OVM. 
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FIXIISTI Parallel FIX and STI 



Example FIX *++AR4 ( 1 ) , Rl 

I I STI R0,*AR2 

Before Instruction: 

AR4 = 8098A2h 

Rl = Oh 

RO = ODCh = 220 

AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1 .7950e+02 

Data at 80983Ch = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 8098A3h 

Rl = 0B3h = 179 

RO = ODCh = 220 

AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1 .79750e+02 

Data at 80983Ch = ODCh = 220 

LUF LVUFNZVC = 0000000 
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Integer to Floating-Point Conversion 



FLOAT 



Syntax 

Operation 

Operands 



FLOAT <src>.<dst> 

f I oat (5/-C) -► dst 

src general addressing modes (G): 
register (Rn, < n < 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n < 7) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
10 11 


1 
G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 

Cycles 
Status Bits 



The integer operand src is converted to the floating-point value equal to it, 
and the result loaded into the dst register. The src operand is assumed to 
be a signed integer, and the dst operand a floating-point number. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 



Unaffected. 



Unaffected. 



IVIode Bit 



OVM Operation not affected by OVM. 
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FLOAT Integer to Floating-Point Conversion 

Example FLOAT *++AR2 ( 2 ) , R5 

Before Instruction: 

AR2 = 809800h 

R5 = 034C2000h = 1 .275781 25e+01 
Data at 809802h = OAEh = 1 74 
LUFLVUFNZVC = 0000000 

After Instruction: 

AR2 = 809802h 

R5 = 072E00000h = 1 .74e+02 

Data at 809802h = OAEh = 1 74 

LUFLVUFNZVC = 0000000 
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Parallel FLOAT and STF 



FLOATIISTF 



Syntax 



Operation 



Operands 



Encoding 
31 



FLOAT <src2><dst1> 
II STF <src3> <dst2> 

float(s/-c2) -► dsti 
1 1 src3 -* dst2 

src2 indirect (disp = 0, 1, IRO, IR1 ) 

dsti register (Rnl, < n1 ^7) 

src3 register (Rn2, ^ n2 ^ 7) 

dst2 register (disp = 0, 1, IRO, IR1) 



— I — I — I — I — I — I — ■ 
110 10 11 



24 23 

— I r 

dsti 



— I — I — 




— 1 — r 
src3 



1615 87 

1 — I — I — I — I — I — r 



dst2 



T 1 1 1 — I 1 r 

src2 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



An integer to floating-point conversion is performed. All registers are read 
at tiie beginning and loaded at the end of the execute cycle. This means that 
if one of the parallel operations (STF) reads from a register and the opera- 
tion being performed in parallel (FLOAT) writes to the same register, then 
STF accepts as input the contents of the register before it is modified by 
FLOAT. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 

C Unaffected. 

UP 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

FLOAT *+AR2 ( IRO ) , R6 
I I STF R7,*AR1 

Before Instruction: 

AR2 = 8098C5h 

IRO = 8h 

R6 - Oh 

R7 = 034C200000h = 1. 275781 25e+01 

AR1 = 809933h 

Data at 8098CDh = OAEh = 174 

Data at 809933h = Oh 

LUF LVUFNZVC = 0000000 
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FLOATIISTF Parallel FLOAT and STF 



After Instruction: 

AR2 = 8098C5h 

IRO = 8h 

R6 = 072E000000h = 1 .740e+02 

R7 = 034C200000h = 1 .275781 25e+01 

AR1 = 809933h 

Data at 8098CDh = OAEh = 174 

Data at 809933h = 034C2000h = 1. 275781 25e+01 

LUFLVUFNZVC = 0000000 
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Interrupt Ackno\A/ledge 



lACK 



Syntax 
Operation 

Operands 
Encoding 



lACK <src> 



Perform a dummy read ope ration with lACK = 0. 
At end of dummy read, set lACK to 1 . 

src general addressing modes (G): 
1 direct 
1 indirect 



31 


24 23 




1615 


87 





1 1 



1 1 1 1 1 
110 110 


1 
G 


1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description A dummy rea d op eration is performed with lACK - 0. At the end of the 
dummy read, lACK is set to 1 . This instruction can be used to generate an 
external interrupt acknowledge. If the address specified is off-chip, a read 
operation from that address is performed. The JACK signal and the address 
can then be used to signal interrupt acknowledge to external devices. The 
data read by the processor is unused. 



Cycles 



1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


l\/lode Bit 


OVM 


Operation n 



Example 



lACK *AR5 
Before instruction: 



lACK = 1 

PC = 300h 

LUFLVUFNZVC-0000000 

After Instruction: 



JACK = 1 

PC = 301 h 

LUFLVUFNZVC^OOOOOOO 




11-75 



IDLE 



Idle Until Interrupt 



Syntax 


IDLE 


Operation 


1 - ST(GIE) 
Next PC -* PC 
Idle until Interrupt 


Operands 


None 


Encoding 




31 


24 23 



1615 



87 



— I — I — 




— I — I — I — I — I — 
110 



I I I I I I I I I I I I I I I I I I I I I I 

00000000000000000000000 



Description The global interrupt enable bit is set, the next PC value is loaded into the 
PC, and the CPU idles until an interrupt is received. When the interrupt is 
received, the contents of the PC are pushed on the active system stack. 



Cycles 




1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 
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Load Floating-Point Exponent 



LDE 



Syntax 

Operation 

Operands 



LDE <src>,<dst> 

5/-c(exp) -► dst{exp) 

src general addressing modes (G): 
register (Rn, ^ n < 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, £ n < 7) 



Encoding 



31 


24 23 






1615 


87 








110 1 


G 


dst 


src 



Description The exponent field of the src operand is loaded into the exponent field of 
the dst register. No modification of the dst register mantissa field is made 
unless the value of the exponent loaded is the reserved value of the expo- 
nent for zero in the precision of the src operand. Then the mantissa field 
of the dst register is set to zero. The src and dst operands are assumed to 
be floating-point numbers. 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 



Example LDE R0,R5 

Before Instruction: 



RO = 0200056F30h = 4.00066337e+00 
R5 = 0A056FE332h = 1.06749648e+03 
LUF LVUFNZVC = 0000000 

After Instruction: 

RO = 0200056F30h = 4.00066337e+00 
R5 = 02056FE332h = 4.1 699081 4e+00 
LUFLVUFNZVC = 0000000 
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LDF 



Load Floating-Point 



Syntax 

Operation 

Operands 



LDF <src>,<dst> 
src -» dst 



src 



general addressing modes (G): 
register (Rn, ^ n :S 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, < n < 7) 



Encoding 
31 



24 23 



1615 



87 



000001110 



dst 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 




The src operand is loaded into the dst register. The dst and src operands 
are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 

C Unaffected. 

UF 

LV Unaffected. 

LUF Unaffected. 



OVM Operation not affected by OVM. 
LDF @9800h,R2 
Before Instruction: 



DP = 80h 

R2 = Oh 

Data at 809800h = 10C52A00h = 2.19254303e+00 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = BOh 

R2 = 010C52A000h = 2.19254303e+00 

Data at BOSBOOh = 10C52A00h = 2.19254303e+00 

LUF LVUFNZVC^OOOOOOO 
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Load Floating-Point Conditionally 



LDFcond 



Syntax 
Operation 



Operands 



LDFcond <src>,<dst> 

If cond is true: 

src -* dst. 
Else: 

dst is unchanged. 

src general addressing modes (G): 
register (Rn, ^ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n £ 7) 



Encoding 



31 


24 23 






1615 


87 





1 1 1 
10 


1 1 1 1 
cond 


1 
G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description If the condition is true, the src operand is loaded into the dst register. 
Otherwise, the dst register is unchanged. The dst and src operands are 
assumed to be floating-point numbers. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). Note that an LDFU (load floating-point unconditionally) in- 
struction is useful for loading R0-R7 without affecting condition flags. 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 



Example LDFZ R3,R5 

Before Instruction: 

R3 = 2CFF2CD500h = 1.77055560e+13 
R5 = 5F0000003Eh = 3.961 40824e+28 
LUF LV UF N Z V C = 1 

After Instruction: 

R3 = 2CFF2CD500h = 1.77055560e+13 
R5 = 2CFF2CD500h = 1.77055560e+1 3 
LUF LV UF N Z V C = 1 
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LDFI 



Load Floating-Point, Interlocked 



Syntax 
Operation 

Operands 



Encoding 
31 



LDFI <src>,<dst> 

Signal interlocked operation. 
src -* dst 

src general addressing modes (G): 

1 direct 

1 indirect 

dst register (Rn, ^ n :S 7) 



24 23 



1615 



87 



— I — I — I — I — I — I — I — I — 
1111 



-I — I — I — r 
dst 



T — I — I — r 



T — I — I — I — I — I — r 
src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The src operand is loaded into the dst register. An interlocked operation is 
signaled over XFO and XF1 . The src and dst operands are assumed to be 
floating-point numbers. Note that only direct and indirect modes are al- 
lowed. Refer to Section 7.3 for detailed description. 

1 if XF1 = (see Section 7.3) 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 

C Unaffected. 

UP 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

LDFI *+AR2,R7 

Before Instruction: 



AR2 = 8098F1 h 

R7 = Oh 

Data at 8098F2h = 584C000h = -6.281 25e+01 

LUFLVUFNZVC = 0000000 

After Instruction: 



AR2 = 8098F1 h 

R7 = 0584C00000h = -6.281 25e+01 

Data at 8098F2h = 584C000h = -6.281 25e+01 

LUF LV UF N Z V C = 1 
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Parallel LDF and LDF 



LDFIILDF 



Syntax 



Operation 



Operands 



Encoding 



LDF <src2>,<dst2> 
II LDF <src1>,<dst1> 

src2 -* dst2 
\\src1 -^ dsti 

srcl indirect (disp = 0, 1, IRO, IR1) 

dsti register (Rnl, ^ n1 ^7) 

src2 indirect (disp = 0, 1, IRO, IR1) 

dst2 register (Rn2, ^ n2 :S 7) 



31 




24 23 




1615 




87 







1 1 


1 1 1 1 
10 


1 1 
dst2 


dsti 





I 1 1 1 1 1 1 
srcl 


"T 1 1 1 1 1 1 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



Example 



Two floating-point loads are performed in parallel. If the LDFs load the 
same register, the assembler issues a warning. The result is that of LDF 
<src2>,<dst2>. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 

LDF * — AR1(IR0),R7 
I I LDF *AR7++(1) ,R3 

Before Instruction: 

AR1 = 80985Fh 

IRO = 8h 

R7 = Oh 

AR7 = 80988Ah 

R3 = Oh 

Data at 809857h = 70C8000h = 1.4050e+02 

Data at 80988Ah = 57B4000h = 6.281 250e+01 

LUF LVUFNZVC = 0000000 

After Instruction: 




AR1 = 809857h 

IRO = 8h 

R7 - 070C800000h = 1 .4050e+02 

AR7 = 80988Bh 

R3 = 057B400000h = 6.281 250e+01 

Data at 809857h = 70C8000h = 1.4050e+02 

Data at 80988Ah = 57B4000h = 6.281 250e+01 

LUFLVUFNZVC = 0000000 
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LDFilSTF 



Parallel LDF and STF 



Syntax 

Operation 

Operands 

Encoding 



LDF <src2><dst1> 
STF <src3> ,<dst2> 

src2 -» dsti 
src3 -» dst2 



src2 indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rnl, ^ n1 ^ 7) 
src3 register (Rn2, £ n2 ^ 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
110 


1 1 
dsti 


1 1 



1 1 
src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 1 1 1 
src2 



Description A floating-point load and a floating-point store are performed in parallel. 



Cycles 
Status Bits 



Mode Bit 
Example 




If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 

LDF *AR2~(1) ,R1 
I I STF R3,*AR4++(IR1) 

Before Instruction: 

AR2 = 8098E7h 

R1 =0h 

R3 = 057B400000h = 6.281 25e+01 

AR4 = 809900h 

IR1 = lOh 

Data at 8098E7h = 70C8000h = 1.4050e+02 

Data at 809900h = Oh 

LUFLVUFNZVC = 0000000 
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Parallel LDF and STF LDF||STF 



After Instruction: 

AR2 = 8098E6h 

R1 = 070C800000h = 1.4050e+02 

R3 = 057B400000h = 6.281 25e+01 

AR4 = 80991 Oh 

IR1 = lOh 

Data at8098E7h = 70C8000h = 1.4050e+02 

Data at 809900h = 57B4000h = 6.281 25e+01 

LUFLVUFNZVC = 0000000 
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LDI 



Load Integer 



Syntax 

Operation 

Operands 



LDI <src>,<cfst> 

src -» dst 

src general addressing modes (G): 
register (Rn, £ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 








1 1 J 1 1 
10 


G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description The src operand is loaded into the dst register. The dst and src operands 
are assumed to be signed integers. An alternate form of LDI, LDP, is used 
to load the data page pointer register (DP), or any other register with the 
eight MSBs of a relocatable address. See Section 1 1 .3.2. 

Cycles 1 



Status Bits 


N 


1 if a negative result is generated, otherwise 




Z 


1 if a zero result is generated, otherwise. 




V 







c 


Unaffected. 




UF 







LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation not affected by OVM. 




Example LDI *-ARl(IR0),R5 

Before Instruction: 

AR1 = 2Ch 

IRO = 5h 

R5 = 3C5h = 965 

Data at 27h = 26h = 38 

LUF LV UF N Z V C = 

After Instruction: 







AR1 = 2Ch 

IRO = 5h 

R5 = 26h = 38 

Data at 27h = 26h = 38 

LUFLVUFNZVC = 0000000 
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Load Integer Conditionally 



IDlcond 



Syntax 
Operation 



Operands 



LD\cond <src>,<dst> 

If cond is true: 

src -*■ dst, 
Else: 

dst is unchanged. 

src general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, :S n ^ 27) 



Encoding 
31 



24 23 



1615 



87 



10 1 



cond 



dst 



Description If the condition is true, the src operand is loaded into the dst register. 
Otherwise, the dst register is unchanged. The dst and src operands are as- 
sumed to be signed integers. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 11.2 for a list of condition mnemonics, encoding, 
and flags). Note that an LDIU (load integer unconditionally) instruction is 
useful for loading R0-R7 without affecting the condition flags. 



Cycles 


1 


Status Bits 


N Unaffected. 




Z Unaffected. 




V Unaffected. 




C Unaffected. 




UF Unaffected. 




LV Unaffected. 




LUF Unaffected. 


Mode Bit 


OVM Operation not a 


Example 


LDIZ R4,R6 




Before Instruction: 




R4 = 027Ch = 636 

R6 = 0FE2h = 4,066 

LUFLVUFNZVC^OOOOOOO 

After Instruction: 

R4 = 027Ch = 636 
R6 = 0FE2h = 4,066 
LUFLVUFNZVC = 0000000 
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LDII 



Load Integer, Interlocked 



Syntax 
Operation 

Operands 



Encoding 
31 



LDII <src>,<dst> 

Signal interlocked operation. 
src -* dst 

src general addressing modes (G): 

1 direct 

1 indirect 

dst register (Rn, ^ n ^ 27) 



24 23 



1615 



87 



I I I I I I I I 
1 1 



-I — I — I — r 
dst 



I I I I I I I I I I I I I I I 
src 



Description The src operand is loaded into the dst register. An interlocked operation is 
signaled over XFO and XF1 . The src and dst operands are assumed to be 
signed integers. Note that only the direct and indirect modes are allowed. 
Refer to Section 7.3 for detailed description. 

Cycles 1 if XF = (see Section 7.3) 



Status Bits 


N 


1 if a negative result is generated,© otherwise 




Z 


1 if a zero result is generated, otherwise. 




V 







c 


Unaffected. 




UF 







LV 


Unaffected 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation not affected by OVM. 



Example 




LDII (a985Fh,R3 

Before Instruction: 

DP = 80 

R3 = Oh 

Data at 80985Fh = ODCh 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80 

R3 = ODCh 

Data at 80985Fh = ODCh 

LUFLVUFNZVC = 0000000 
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Parallel LDI and LDI 



LDIIILDI 



Syntax 

Operation 

Operands 

Encoding 



LDI <src2> ,<dst2> 
II LDI <src1>,<dst1> 

src2 -* dst2 
Wsrcl -^ dsti 



:7 indirect (disp = 0, 1, IRO, IR1) 
t7 register (Rnl, < n1 ^7) 

-9 inHirort Mien = n 1 IRO IRI^ 



src'i 

dsti regioi^. v--" ■» - ^ ■■ ■ ^ ■ , 

src2 indirect (disp = 0, 1, IRO, IR1) 

dst2 register (Rn2, ^ n2 ^ 7) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
11 


1 1 
dst2 


1 1 
dsti 


1 1 



1 1 1 1 1 1 1 
srcl 


1 J 1 1 1 1 1 
src2 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



Two integer loads are performed in parallel. A warning is issued by the 
assembler if the LDIs load the same register. The result is that of LDI 
<src2>,<dst2>. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 



I I 



LDI 
LDI 



*-ARl(l) ,R7 
*AR7++(IR0) ,R1 



Before Instruction: 



AR1 


= 809826h 








R7 = 


Oh 








AR7 


- 8098C8h 








IRO = 


= lOh 








R1 = 


Oh 








Data at 809825h = 


OFAh 


= 250 




Data at 8098C8h = 


2EEh 


= 750 




LUF 


LV UF N Z 


V C = 








Afte 


r Instruction 










AR1 = 809826h 

R7 = OFAh = 250 

AR7 = 8098D8h 

IRO - lOh 

R1 = 02EEh = 750 

Data at 809825h = OFAh = 250 

Data at 8098C8h = 2EEh = 750 

LUFLVUFNZVC = 0000000 
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LDIIISTI 



Parallel LDI and STI 



Syntax 



Operation 



Operands 



Encoding 



LDI <src2><dst1> 
II STI <src3><dst2> 

src2 -^ dsti 
\\src3 -> dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dst^ register (Rnl, ^ n1 < 7) 

src3 register (Rn2, ^ n2 <7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
110 1 


dsti 


1 1 



1 1 
src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 1 1 1 
src2 



Description An integer load and an integer store are performed in parallel. 



Cycles 
Status Bits 



If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



l\/lode Bit 



OVM Operation not affected by OVM. 




11-88 



Parallel LDI and STI LDIMSTI 



Example LDI *-ARl ( 1 ) , R2 

I I STI R7,*AR5++(IR0) 

Before Instruction: 



AR1 = 8098E7h 

R2 = Oh 

R7 = 35h = 53 

AR5 = 80982Ch 

IRO - 8h 

Data at 8098E6h = ODCh = 220 

Data at 80982Ch = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 8098E7h 

R2 = ODCh = 220 

R7 - 35h = 53 

AR5 = 809834h 

IRO = 8h 

Data at 8098E6h = ODCh = 220 

Data at 80982Ch = 35h = 53 

LUF LVUFNZVC-0000000 
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LDM 



Load Floating-Point Mantissa 



Syntax 

Operation 

Operands 



LDM <src>,<dst> 

src(man) -► dst{man) 

src general addressing modes (G): 
register (Rn, ^ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 
31 



24 23 



1615 



87 



10 10 



dst 



Description The mantissa field of the src operand is loaded into the mantissa field of the 
dst register. The dst exponent field is not modified. The src and dst op- 
erands are assumed to be floating-point numbers. If immediate addressing 
mode is used, bits 15 - 1 2 of the instruction word are forced to by the 
assembler. 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 



Example 




LDM 156.75,R2 (1 56.75 = 071 CCOOOOOh) 
Before Instruction: 

R2 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R2 = 001 CCOOOOOh = 1.22460938e+00 
LUFLVUFNZVC = 0000000 
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Logical Shift 



LSH 



Syntax 
Operation 



Operands 



LSH <count> ,<dst> 

If count > 0: 

dst << count -* dst 
Else: 

dst >> \count\ -* dst 

src general addressing modes (G): 
register (Rn, ^ n £ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 
31 



24 23 



1615 



87 



— : — I — I — I — I — 1 — I — I — 
10 10 11 



dst 



count 



Description 



Cycles 
Status Bits 



The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count. If the count operand is greater than zero, 
the dst operand is left- shifted by the value of the count operand. Low- 
order bits shifted in are zero-filled, and high-order bits are shifted out 
through the C (carry) bit. 

Logical left-shift: 

C^ dst^ 

If the count operand is less than zero, the dst is right-shifted by the absolute 
value of the count operand. The high-order bits of the dst operand are 
zero-filled as shifted to the right. Low-order bits are shifted out through the 
C (carry) bit. 

Logical right-shift: 
Q ^ dst-^ C 

If the count operand is 0, no shift is performed and the C (carry) bit is set 
to 0. The count operand is assumed to be a signed integer and the dst 
operand is assumed to be an unsigned integer. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 




MSB of the output. 

1 if a zero output is generated, otherwise. 



Set to the value of the last bit shifted out. for a shift count of 0. 

Unaffected if dst is not R0-R7. 



Unaffected. 



IVIode Bit 



OVM Operation not affected by OVM. 
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LSH 



Logical Shift 



Example LSH R4,R7 

Before Instruction: 

R4 = 018h = 24 

R7 = 02ACh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R4 = 018h = 24 

R7 = OACOOOOOOh 

LUF LVUFNZVC^OOOI 01 

Example LSH *-AR5 (IRl) ,R5 

Before Instruction: 

AR5 = 809908h 

IRO = 4h 

R5 = 0012C00000h 

Data at 809904h = 0FFFFFFF4h = -12 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 809908h 

IRO = 4h 

R5 = 000001 2C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LVUFNZVC = 0000000 
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Logical Shift, 3-Operand 



LSH3 



Syntax 
Operation 



Operands 



LSH3 <count>.<src>,<dst> 

If count > 0: 

src < < count -*■ dst 
Else: 

src >> \count\ -* dst 

src three-operand addressing modes (T): 
register (Rnl, ^ n < 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, < n1 ^ 26) 
11 indirect (disp = 0, 1, IRO, IR1) 

count three-operand addressing modes (T): 
register (Rn2, ^ n2 £ 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 




87 







1 


1 


T 


. dst 


src 


1 1 1 1 1 1 1 
count 



Description The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, 
and high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 
C *- S/-C *- 

If the count operand is less than zero, the src operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are zero-filled as shifted to the right. Low-order bits are shifted out through 
the C (carry) bit. 

Logical right-shift: 
-* src -* C 

If the count operand is 0, no shift is performed and the C (carry) bit is set 
to 0. The count operand is assumed to be a signed integer. The src and 
dst operands are assumed to be unsigned integers. 

Cycles 1 




11-93 



LSH3 



Logical Shift, 3-Operand 



Status Bits 


N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 






C Set to the value of the last bit shifted out. for a 


shift count of 0. 




Unaffected if dst is not R0-R7. 






UF 






LV Unaffected. 






LUF Unaffected. 




Mode Bit 


OVM Operation not affected by OVM. 




Example 


LSH3 R4,R7,R2 
Before Instruction: 





Example 




R4 = 018h = 24 

R7 = 02ACh 

R2 = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 



1 



R4 = 018h - 24 

R7 = 02ACh 

R2 = OACOOOOOOh 

LUF LVUFNZVC = 0001 

LSH3 *-AR4(IRl)R5,R3 
Before Instruction: 

AR4 = 809908h 

IR1 = 4h 

R5 = 012C00000h 

R3 = Oh 

Data at 809904h = 0FFFFFFF4h - -12 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 809908h 

IR1 = 4h 

R5 = 012C00000h 

R3 = 000001 2C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUF LVUFNZVC-0000000 
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Parallel LSH3 and STI 



LSH3IISTI 



Syntax 
Operation 

Operands 



Encoding 
31 



LSH3 <count>,<src2>,<dst1> 
II STI <src3> ,<dst2> 

If count >_ 0: 

src2 << count -^ dsti 
Else: 

src2 >> \count\ -^ dsti 
1 1 src3 -^ dst2 

count register (Rnl, ^ n1 ^7) 
srcl indirect (disp = 0, 1, IRO, IR1) 
dsti register (Rn3, < n3 < 7) 
src2 register (Rn4, < n4 <7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 



— I — I — I — I — 
110 1110 



24 23 

— I — r 
dsti 



1615 



87 



I I 
count 



src3 



T I I I I T 

dst2 



I I I I I r 
src2 



Description 



The seven least-significant bits of the count operand are used to generate 
the two's-complement shift count. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low- order bits shifted in are zero-filled 
and high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 

C ♦- dst2 ^ 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are zero filled as shifted to the right. Low-order bits are shifted out through 
the C (carry bit). 

Logical right-shift: 
-* dst2 -* C 

If the count operand is 0, no shift is performed and the carry bit is set to 0. 

The count operand is assumed to be a 7-bit signed integer and the src2 and 
dsti operands are assumed to be unsigned integers. All registers are read 
at the beginning and loaded at the end of the execute cycle. This means 
that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (LSH3) writes to the same register, 
then STI accepts as input the contents of the register before it is modified 
by the LSH3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 



Cycles 
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LSH3IISTI 




Parallel LSH3 and STI 


Status Bits 
Mode Bit 


N 

Z 
V 

c 

UF 
LV 
LUF 

OVM 


MSB of the output. 

1 if a zero output is generated, otherwise. 



Set to the value of the last bit shifted out. for a shift count of 0. 



Unaffected. 

Unaffected. 

Operation not affected by OVM. 



Example LSH3 R2 , *++AR3 ( 1) ,R0 

I I STI R4,*-AR3 



Before Instruction: 

R2 = 18h = 24 

AR3 = 8098C2h 

RO = Oh 

R4 = ODCh = 220 

AR3 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = Oh 

LUF LV UF N Z V C = 

After Instruction: 








R2 = 18h = 24 

AR3 = 8098C3h 

RO = OACOOOOOOh 

R4 = ODCh = 220 

AR3 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = ODCh = 220 

LUF LVUFNZVC = 0001 000 
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Parallel LSH3 and STI LSH3IISTI 



Example LSH3 R7,*AR2 — (1),R2 

I I STI R0,*+AR0(1) 

Before Instruction: 

R7 = 0FFFFFFF4h =-12 

AR2 = 809863h 

R2 = Oh 

RO = 12Ch = 300 

ARO = 8098B7h 

Data at 809863h = 2C000000h 

Data at 8098B8h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R7 = 0FFFFFFF4h =-12 

AR2 = 809862h 

R2 = 2C000h 

RO = 12Ch = 300 

ARO = 8098B7h 

Data at 809863h = 2C000000h 

Data at 8098B8h = 12Ch = 300 

LUFLVUFNZVC = 0000000 
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MPYF 



Multipy Floating-Point 



Syntax 

Operation 

Operands 



MPYF <src>,<dst> 

dst X src -* dst 

src general addressing modes (G): 
register (Rn, ^ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 



31 


24 23 






1615 


87 








10 10 


G 


dst 


src 



Description 



Cycles 
Status Bits 



IVIode Bit 
Example 




The product of the dst and src operands is loaded into the dst register. The 
src operand is assumed to be a single-precision floating-point number, and 
the (ysf operand is an extended-precision floating-point number. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point is overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



OVM Operation not affected by OVM. 
MPYF R0,R2 
Before Instruction: 



RO - 070C800000h = 
R2 = 034C200000h = 
LUF LV UF N Z V 

After Instruction: 



1.4050e+02 
1. 275781 25e+01 
C = 0000000 



RO = 070C800000h = 1 .4050e+02 

R2 = 0A600F2000h = 1 .79247266e+03 

LUFLVUFNZVC = 0000000 
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Multiply Floating-Point, 3-Operand 



MPYF3 



Syntax 

Operation 

Operands 



Encoding 



MPYFS <src2><src1>,<dst> 

srcl X src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, £ n1 ^7) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, <, x^^ <,!) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 < 7) 

1 register (Rn2, ^ n2 ^ 7) 
10 indirect (disp = 0, 1, IRO, I R1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n ^ 7) 



31 


24 23 






1615 




87 







1 1 
1 


1 1 1 1 1 
1 1 


1 

T 


1 1 1 1 
dst 


1 1 1 1 1 1 
srcl 


1 1 1 1 1 1 1 
src2 



Description 



Cycles 
Status Bits 



The product of the dsti and src2 operands is loaded into the dst register. 
The src7 and src2 operands are assumed to be single-precision floating- 
point numbers, and the dst operand is an extended-precision floating-point 
number. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



Mode Bit 



OVM Operation not affected by OVM. 
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MPYF3 



Multiply Floating-Point, 3-Operand 



Example 



Example 




MPYF3 R0,R7,R1 
Before Instruction: 

RO = 057B400000h = 6.281 250e+01 

R7 = 0733C00000h = 1 .79750e+02 

R1 = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 



RO = 057B400000h = 6.281 250e+01 
R7 = 0733C00000h = 1.79750e+02 
R1 = 0D306A3000h = 1.12905469e+04 
LUFLVUFNZVC = 0000000 

MPYF3 *+AR2(IR0) ,R7,R2 

or 

MPYF3 R7,*+AR2(IR0) ,R2 

Before Instruction: 



AR2 = 809800h 

IRO = 12Ah 

R7 = 057B400000h = 6.281 250e+01 

R2 = Oh 

Data at 80992Ah = 70C8000h = 1 .4050e+02 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR2 = 809800h 

IRO = 12Ah 

R7 = 057B400000h = 6.281 250e+01 

R2 = 0D09E4A000h = 8.8251 5625e+03 

Data at 80992Ah = 70C8000h = 1.4050e+02 

LUFLVUFNZVC = 0000000 
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Parallel MPYF3 and ADDF3 



MPYF3IIADDF3 



Syntax 
Operation 

Operands 



MPYF3 <srcA>,<srcB>,<dst1> 
IIADDF3 <srcC><srcD><dst2> 



srcA X srcB 
1 1 srcC + srcD 



dsti 
dst2 



srcA 
srcB 
srcC 
srcD 



Any two indirect (disp = 0,1 ,IR0,IR1 ) 
Any two register (0 < ARn <, 7) 



dsti 



dst2 



srcl 
src2 
srcS 
src4 



register {d1): 

= RO 

1 = R1 

register {d2): 

= R2 

1 = R3 

register (Rn, ^ n < 7) 

register (Rn, < n < 7) 

indirect (disp = 0, 1, IRO, IRI ) 

indirect (disp = 0, 1, IRO, IRI) 

parallel addressing modes (0 < P ^ 3) 
OPERATION 











00 
01 
10 
11 




srcS X src4, srcl + src2 
src3 X srcl , src4 + src2 
srcl X src2, srcS + src4 
src3 X srcl, src2 + src4 










Encoding 














31 


24 23 




1615 


87 







1 


1 1 1 



P 


d1 


d2 


1 1 
srcl 


src2 


1 1 1 1 1 1 1 
src3 


1 1 1 1 1 1 1 
src4 



Description A floating-point multiplication and a floating-point addition are performed 
in parallel. All registers are read at the beginning and loaded at the end of 
the execute cycle. This means that if one of the parallel operations (MPYF3) 
reads from a register and the operation being performed in parallel 
(ADDF3) writes to the same register, then MPYF3 accepts as input the 
contents of the register before it is modified by the ADDF3. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg- 
ister. The assignment of the source operands srcA-srcD to the src1-src4 
fields varies depending on the combination of addressing modes used, and 
the P field is encoded accordingly. The assembler may, when not signif- 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 




11-101 



MPYF3IIADDF3 



Parallel MPYF3 and ADDF3 



Cycles 
Status Bits 



Mode Bit 
Example 




If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 

Z 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

MPYF3 *AR5++(1),* — AR1(IR0),R0 
I I ADDF3 R5,R7,R3 

Before Instruction: 

AR5 = 8098C5h 

AR1 = 8098A8h 

IRO = 4h 

RO = Oh 

R5 = 0733CO00O0h = 1.79750e+02 

R7 = 070C800000h = 1.4050e+02 

R3 = Oh 

Data at 8098C5h = 34C0000h - 1.2750e+01 

Data at 8098A4h = IllOOOOh = 2.2500e+00 

LUF LVUFNZVC = 0000000 

After Instruction: 

AR5 = 8098C6h 

AR1 = 8098A4h 

IRO = 4h 

RO = 04671 SOOOOh = 2.888671 88e+01 

R5 = 0733C00000h = 1.79750e+02 

R7 = 070C800000h = 1.4050e+02 

R3 = 0820200000h = 3.20250e+02 

Data at 8098C5h = 34C0000h = 1 .2750e+01 

Data at8098A4h = IllOOOOh = 2.2500e+00 

LUFLVUFNZVC = 0000000 



11-102 



Parallel MPYF3 and STF 



MPYF3IISTF 



Syntax 



Operation 



Operands 



Encoding 
31 



MPYF3 <src2><src1>,<dst1> 
STF <src3> .<dst2> 



srcl 

\\src3 



src2- 
dst2 



dsti 



srcl register (Rnl, < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1 ) 

dsti register (Rn3, < n3 < 7) 

src3 register (Rn4, :S n4 <7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



24 23 

— I — r 
dsti 



1615 



87 



-1 — r 
srcl 



— 1 — r- 
src3 



— I — I — I — r 
src2 



T 1 1 I 

110 1111 



dst2 



Description 



Cycles 
Status Bits 



Mode Bit 



A floating-point multiplication and a floating-point store are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (MPYF3) 
reads from a register and the operation being performed in parallel (STF) 
writes to the same register, then MPYF3 accepts as input the contents of 
the register before it is modified by the STF. 

If src2 and dst2 point to the same location, then src2 is read before the write 
to dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UP 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



OVM Operation not affected by OVM. 
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MPYF3IISTF Parallel MPYF3 and STF 



Example MPYF3 *-AR2 ( 1) ,R7 ,R0 

I I STF R3,*AR0 — (IRO) 

Before Instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281 250e+01 

RO = Oh 

R3 = 086B280000h = 4.7031 250e+02 

ARO = 809860h 

IRO = 8h 

Data at80982Ah = 70C8000h = 1.4050e+02 

Data at 809860h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281 250e+01 

RO = 0D09E4A000h = 8.8251 5625e+03 

R3 = 086B280000h = 4.7031 250e+02 

ARO = 809858h 

IRO = 8h 

Data at 80982Ah = 70C8000h = 1.4050e+02 

Data at 809860h = 86B280000h = 4.7031 250e+02 

LUFLVUFNZVC = 0000000 
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Parallel MPYF3 and SUBF3 



MPYF3IISUBF3 



Syntax 
Operation 

Operands 



MPYF3 <srcA>,<srcB>,<dst1> 
SUBF3 <srcC>,<srcD>,<dst2> 



srcA 
srcD 



srcA 
srcB 
srcC 
srcD 



t srcB 
srcC - 



*■ dsti 
dst2 



dsti 



dst2 



srcl 
src2 
src3 
src4 



Encoding 



Any two indirect (disp ^ 0,1,iR0,IR1) 
Any two register (0 < ARn <, 7) 



register {d1): 

= RO 

1 = R1 

register {d2)\ 

= R2 

1 = R3 

register (Rn, ^ n ^ 7) 

register (Rn, ^ n ^ 7) 

indirect (disp = 0, 1, IRO, IR1 ) 

indirect (disp = 0, 1, IRO, IR1) 

parallel addressing modes (0 < P < 3) 
OPERATION 

00 src3 X src4, srcl - src2 

01 src3 X srcl , src4 - src2 

10 srcl \ src2, srcS - src4 

11 srcS X srcl , src2 - src4 



31 




24 23 






1615 87 


1 
1 


1 1 1 
1 


1 
P 


d1 


d2 


1 1 
srcl 


1 — I 
src2 


1 1 1 1 1 1 1 
src3 


src4 
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MPYF3liSUBF3 



Parallel MPYFS and SUBF3 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




A floating-point multiplication and a floating-point subtraction are per- 
formed in parallel. All registers are read at the beginning and loaded at the 
end of the execute cycle. This means that if one of the parallel operations 
(MPYF3) reads from a register, and the operation being performed in par- 
allel (SUBF3) writes to the same register, then MPYF3 accepts as input the 
contents of the register before it is modified by the SUBF3. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg- 
ister. The assignment of the source operands srcA-srcD to the srcl -src4 
fields varies depending on the combination of addressing modes used, and 
the P field is encoded accordingly. The assembler may, when not signif- 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 

1 

N 

Z 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UP 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



OVM Operation not affected by OVM. 



MPYF3 
I I SUBF3 
or 

MPYF3 
I I SUBF3 



R5,*++AR7(IR1) ,R0 
R7,*AR3 — (1) ,R2 

R5 , *++AR7 ( IRl ) , R5 , RO 
R7,*AR3— (1) ,R2 



Before Instruction: 

R5 = 034C000000h = 1.2750e+01 

AR7 = 809904h 

IRl = 8h 

RO = Oh 

R7 = 0733C00000h = 1.79750e+02 

AR3 - 8098B2h 

R2 = Oh 

Data at 80990Ch = IllOOOOh = 2.250e+00 

Data at 8098B2h = 70C8000h = 1.4050e+02 

LUFLVUFNZVC = 0000000 
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Parallel MPYF3 and SUBF3 MPYF3||SUBF3 



After Instruction: 

R5 = 034C000000h = 1.2750e+01 

AR7 = 80990Ch 

IR1 = 8h 

RO = 04671 SOOOOh = 2.888671 88e+01 

R7 = 0733C00000h = 1.79750e+02 

AR3 = 8098B1h 

R2 = 05E3000000h = -3.9250e+01 

Data at 80990Ch = 111 OOOOh = 2.250e+00 

Data at 8098B2h = 70C8000h = 1.4050e+02 

LUFLVUFNZVC = 0000000 
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MPYI 



Multiply Integer 



Syntax 

Operation 

Operands 



MPYI <src>,<dst> 

dst X src -* dst 

src general addressing modes (G): 
register (Rn, < n < 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 
31 



24 23 



1615 



87 



— I — I — \ — I — I — I — I — I — 
10 10 1 



T — I — I — r 
dst 



T — I — I — I — I — I — I — I — I — I — I — I — I — r 
src 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 




The product of the dst and src operands is loaded into the dst register. The 
src and dst operands when read are assumed to be 24-bit signed integers. 
The result is assumed to be a 48-bit signed integer. The output to the dst 
register is the 32 least-significant bits of the result. 

Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 



OVM Operation affected by OVM. 

MPYI R1,R5 
Before Instruction: 



R1 = 000033C251 h -- 
R5 = 000078B600h = 
LUF LV UF N Z V 

After Instruction: 



3,392,081 
7,910,912 
C = 0000000 



R1 = 000033C251 h = 3,392,081 

R5 = 00E21 D9600h = -501,377,536 

LUF LV UF N Z V C = 1 1 1 
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Multiply Integer, 3-Operand 



IV/1PYI3 



Syntax 

Operation 

Operands 



MPYI3 <src2>,<src1>,<dst> 

srcl X src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, < n1 ^ 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, ^ n1 ^ 27) 

11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 < 27) 

1 register (Rn2, ^ n2 < 27) 

1 indirect (disp = 0, 1, IRO, IR1 ) 
11 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



Encoding 
31 



24 23 

T I I I r I I f 

10 10 10 



1615 



87 



-1 1 — I 1 — r 

src2 



I I i r 

dst 



T i I I I T 

srcl 



Description 



Cycles 
Status Bits 



Mode Bit 



The product of the srcl and src2 operands is loaded into the dst register. 
The src7 and src2 operands are assumed to be 24-bit signed integers. The 
result is assumed to be a signed 48-bit integer. The output to the dst reg- 
ister is the 32 least-significant bits of the result. 

Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 



OVM Operation affected by OVM. 
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MPYI3 



Multiply Integer, 3-Operand 



Example 



Example 



MPYI 3 * AR4 , * -ARl ( 1 ) , R2 
Before Instruction: 

AR4 = 809850h 

ARl = 8098F3h 

R2 = Oh 

Data at 809850h = OADh = 173 

Data at 8098F2h = ODCh = 220 

LUFLVUFNZVC = 0000000 

After Instruction: 



AR4 = 809850h 

ARl = 8098F3h 

R2 - 094ACh = 38,060 

Data at 809850h = OADh = 173 

Data at 8098F2h = ODCh = 220 

LUF LVUFNZVC = 0000000 

MPYI3 * — AR4(IR0) ,R2,R7 
Before Instruction: 

AR4 = 8099F8h 

IRO = 8h 

R2 = 0C8h = 200 

R7 = Oh 

Data at 8099F0h = 32h = 50 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 8099F0h 

IRO = 8h 

R2 = 0C8h = 200 

R7 = 0271 Oh = 10,000 

Data at 8099F0h = 32h = 50 

LUFLVUFNZVC^OOOOOOO 
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Parallel MPYI3 and ADDIS 



MPYI3IIADDI3 



Syntax 
Operation 

Operands 



MPYI3 <srcA>,<srcB>,<dst1> 
ADDI3 <srcC>,<srcD>,<dst2> 



srcA X srcB 
1 1 srcD + srcC 



dsti 
dst2 



srcA 
srcB 
srcC 
srcD 



dsti 



dst2 



srcl 
src2 
src3 
src4 



Encoding 



Any two indirect (disp = 0,1,IR0,IR1) 
Any two register (0 <, ARn <, 7) 



register {d1): 

= RO 

1 = R1 

register {d2): 

= R2 

1 = R3 

register (Rn, < n < 7) 

register (Rn, < n ^ 7) 

indirect (disp = 0, 1, IRO, IR1 ) 

indirect (disp = 0, 1 , IRO, IR1 ) 

parallel addressing modes (0 < P ^ 3) 
OPERATION 

00 src3 X src4, srcl + src2 

01 src3 X srcl , src4 + src2 

10 srcl X src2, srcS + src4 

1 1 srcS X srcl , src2 + src4 



31 




24 23 






1615 87 


1 
1 


1 1 1 
10 


1 
P 


d1 


d2 


1 1 
srcl 


1 — [■ "" 
src2 


1 1 1 1 1 1 1 
src3 


1 1 1 1 1 1 1 
src4 
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MPYI3IIADDI3 



Parallel MPYI3 and ADDIS 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




An integer multiplication and an integer addition are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (MPYI3) reads from 
a register and the operation being performed in parallel (ADDIS) writes to 
the same register, then MPYI3 accepts as input the contents of the register 
before it is modified by the ADDIS. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg- 
ister. The assignment of the source operands srcA-srcD to the src1-src4 
fields varies depending on the combination of addressing modes used, and 
the P field is encoded accordingly. The assembler may, when not signif- 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 

1 

N 

Z 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 

OVM Operation affected by OVM. 

MPYI3 R7,R4,R0 
II ADDIS *-AR3,*AR5~(l) ,R3 

Before Instruction: 

R7 = 14h = 20 

R4 = 64h = 100 

RO = 0h 

ARS = 80981 Fh 

AR5 = 80996Eh 

RS = Oh 

Data at 80981 Eh = OFFFFFFCBh = -5S 

Data at 80996Eh = 35h = 5S 

LUF LVUFNZVC = 0000000 

After Instruction: 

R7 = 14h = 20 

R4 = 64h = 100 

RO = 07D0h = 2000 

ARS = 80981 Fh 

ARS = 80996Dh 

RS = Oh 

Data at 80981 Eh = OFFFFFFCBh = -5S 

Data at 80996Eh = 35h - 5S 

LUF LVUFNZVC = 0000000 
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Parallel MPYI3 and STI3 



MPYI3IISTI 



Syntax 



Operation 



Operands 



Encoding 



MPYI3 <src2>,<src1>,<dst1> 
STI <src3> ,<dst2> 



srcl X src2 
src3 -* dst2 



dsti 



srcl register (Rnl, ^ n1 <,!) 

src2 indirect (disp = 0, 1, IRO, IR1 ) 

dsti register (Rn3, ^ n3 ^ 7) 

src3 register (Rn4, ^ n4 <, 7) 

dst2 indirect (disp = 0, 1, IRO, IRI) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
10 


1 1 
dsti 


1 1 
srcl 


1 1 
src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 J 1 1 
src2 



Description 



Cycles 
Status Bits 



Mode Bit 



An integer multiplication and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (MPYI3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the MPYI3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation affected by OVM. 
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MPYI3IISTI Parallel MPYI3 and STI3 



Example MPYI3 *++aR0 ( 1) ,R5 ,R7 

I I STI R2,*-AR3(1) 

Before Instruction: 

ARO = 80995Ah 

R5 = 32h = 50 

R7 =0h 

R2 = ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

ARO = 80995Bh 

R5 = 32h = 50 

R7 = 271 Oh = 10000 

R2 = ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = ODCh = 220 

LUFLVUFNZVC = 0000000 
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Parallel MPYI3 and SUBI3 



MPYI3IISUB[3 



Syntax 
Operation 

Operands 



MPYI3 <srcA>,<srcB>,<dst1> 
SUBI3 <srcC> <srcD> ,<dst2> 



srcA X srcB 
srcD - srcC - 



>■ dsti 
dst2 



srcA 
srcB 
srcC 
srcD 



Any two indirect (disp = 0,1,IR0,IR1) 
Any two register (0 ^ ARn <, 7) 



dsti 



register {d1): 

= RO 

1 = R1 



Encoding 



dst2 register {d2): 

= R2 

1 = R3 

srcl register (Rn, ^ n £ 7) 

src2 register (Rn, < n < 7) 

src3 indirect (disp = 0, 1, IRO, IR1 ) 

src4 indirect (disp = 0, 1, IRO, IR1) 

P parallel addressing modes (0 ^ P ^ 3) 

OPERATION 

00 src3 X src4, srcl - src2 

01 src3 X srcl , src4 - src2 

10 srcl X src2, srcS - src4 

1 1 src3 xsrcl, src2 - src4 



31 




24 23 






1615 




87 







1 
1 


1 1 1 
11 


1 
P 


d1 


d2 


1 1 
srcl 


1 f 
src2 


1 1 1 1 1 1 1 
src3 


1 1 1 1 1 1 1 
src4 



Description An integer multiplication and an integer subtraction are performed in par- 
allel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (MPYI3) 
reads from a register and the operation being performed in parallel (SUBI3) 
writes to the same register, then MPYI3 accepts as input the contents of the 
register before it is modified by the SUBI3. 

Any combination of addressing modes may be coded for the four possible 
source operands as long as the two are coded as indirect and two are reg- 
ister. The assignment of the source operands srcA-srcD to the srcl -src4 
fields varies depending on the combination of addressing modes used, and 
the P field is encoded accordingly. The assembler may, when not signif- 
icant, change the order of operands in commutative operations, in order to 
simplify processing. 
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MPYI3IISUBI3 



Parallel MPYI3 and SUBI3 



Cycles 
Status Bits 



Mode Bit 
Example 



Integer overflow occurs when any of the most-significant 16 bits of the 
48-bit result differs from the most-significant bit of the 32-bit output value. 

1 

N 

Z 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

UF 1 if an integer underflow occurs, otherwise. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unchanged. 



OVM Operation affected by OVM. 

MPYI3 R2,*++AR0(1) ,R0 
II SUBI3 *AR5 — (IR1),R4,R2 
or 




MPYI3 *++ARO(l) ,R2,R0 
II SUBI3 *AR5 — (IR1),R4,R2 

Before Instruction: 



R2 = 32h = 50 






ARO = 8098E3h 






RO = Oh 






AR5 = 8099FCh 






IR1 = OCh 






R4 = 07D0h = 2000 






Data at 8098E4h = 62h = 


98 




Data at 8099FCh = 4B0h 


= 1200 




LUF LV UF N Z V C = 








After Instruction: 







R2 = 320h = 800 

ARO = 8098E4h 

RO = 01 324h = 4900 

AR5 = 8099F0h 

IR1 = OCh 

R4 - 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1200 

LUFLVUFNZVC = 0000000 
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Negative Integer with Borro\A/ 



NEGB 



Syntax 

Operation 

Operands 



NEGB <src>,<c/st> 

- src - C -* dst 

src general addressing modes (G): 
register (Rn, < n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n < 27) 



Encoding 



31 


24 23 






1615 


87 








10 110 


G 


dst 


src 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



Tlie difference of the 0, src, and C operands is loaded into the dst register. 
The dst and src are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation affected by OVM. 

NEGB R5,R7 
Before Instruction: 



R5 - OFFFFFFCBh = -53 

R7 = Oh 

LUF LV UF N Z V C = 1 

After Instruction: 

R5 = OFFFFFFCBh =-53 

R7 = 34h = 52 

LUF LV UF N Z V C = 1 
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NEGF 



Negate Floating-Point 



Syntax 


NEGF <src>,<dst> 


Operation 


- src -* dst 


Operands 


src general addressing modes (G) 
register (Rn, < n < 7) 

1 direct 

1 indirect 

1 1 immediate 




dst register (Rn, :S n < 7) 


Encoding 





31 



24 23 



1615 87 

I I I I I I I I I I I I 



10 111 



I I I r 

dst 



Description 



Cycles 
Status Bits 



l\/lode Bit 



Example 




The difference of the and src operands is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

NEGF *++AR3(2),Rl 

Before Instruction: 

AR3 = 809800h 

R1 = 057B400025h = 6.281 25006e+01 
Data at 809802h = 70C8000h = 1 .4050e+02 
LUF LVUFNZVC = 0000000 

After Instruction: 

AR3 = 809802h 

R1 = 07F3800000h = -1.4050e+02 

Data at 809802h = 70C8000h = 1 .4050e+02 

LUF LVUFNZVC = 0001 000 
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Parallel NEGF and STF 



NEGFIISTF 



Syntax 



Operation 



Operands 



Encoding 

31 



NEGF <src2><dst1> 
II STF <src3> ,<dst2> 

- src2 -> dsti 
1 1 src3 -+ (^5^2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dsti register (Rnl, < n1 ^7) 

src3 register (Rn2, ^ n2 :S 7) 

dst2 indirect (disp = 0, 1, IRO, IR1 ) 



24 23 



1615 



87 



T 1 1 1 

1110 1 



— 1 — I— 
src3 



I I 
dsti 







T I I I I I 

dst2 



I I I I I r 
src2 



Description 



Cycles 
Status Bits 



A floating-point negation and a floating-point store are performed in par- 
allel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (NEGF) writes 
to the same register, then STF accepts as input the contents of the register 
before it is modified by the NEGF. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



Mode Bit 



OVM Operation not affected by OVM. 
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NEGFIISTF Parallel NEGF and STF 



Example NEGF *AR4--(1),R7 

I I STF R2,*++AR5(1) 

Before Instruction: 

AR4 = 8098E1h 

R7 = Oh 

R2 = 0733C00000h = 1.79750e+02 

AR5 = 809803h 

Data at8098E1h = 57B400000h = 6.281 250e+01 

Data at 809804h = Oh 

LUFLVUFNZVC^OOOOOOO 

After Instruction: 

AR4 = 8098E0h 

R7 = 0584C00000h = -6.281 250e+01 

R2 = 0733C00000h = 1.79750e+02 

AR5 = 809804h 

Data at 8098E1h = 57B4000h = 6.281 250e+01 

Data at 809804h = 733C000h = 1.79750e+02 

LUF LV UF N Z V C = 1 
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Negate Integer 



NEGI 



Syntax 

Operation 

Operands 



NEGI <src>,<dst> 

- src -* dst 

src general addressing modes (G): 
register (Rn, ^ n < 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n < 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
110 


1 

G 


1 J 1 J 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the and src operands is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation affected by OVM. 

NEGI 174, R5 (174 = 0AEh) 
Before Instruction: 

R5 = ODCh = 220 
LUFLVUFNZVC = 0000000 

After Instruction: 



R5 = 0FFFFFF52 =-174 

LUF LV UF N Z V C = 1 1 
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NEGIIISTI 



Parallel NEGI and STi 



Syntax 



Operation 



Operands 



Encoding 
31 



NEGI <src2>,<dst1> 
II STI <src3> ,<dst2> 

- src2 -* dst1 
\\src3 -» dstl 

src2 indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rnl, ^ n1 ^7) 

src3 register (Rn2, ^ n2 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



24 23 



1615 



T 1 1 1 

1110 10 



— I — I — 




-1 — r- 
src3 



T — I — I — I — I — r 
dst2 



87 

T — I — I — I — I — I — r 



I 1 
dstl 



src2 



Description 



Cycles 
Status Bits 



Mode Bit 



An integer negation and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (NEGI) writes to the same 
register, then STI accepts as input the contents of the register before it is 
modified by the NEGI. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation affected by OVM. 
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Parallel NEGI and STI NEGIIISTI 



Example NEGI *-AR3,R2 

I I STI R2,*AR1++ 

Before Instruction: 

AR3 = 80982Fh 

R2 = 19h = 25 

AR1 = 8098A5h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 80982Fh 

R2 = 0FFFFFF24h = -220 

AR1 = 8098A6h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = 19h - 25 

LUF LVUFNZVC = 0001 001 
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NOP 



No Operation 



Syntax 
Operation 

Operands 
Encoding 



NOP <src> 

No ALU or multiplier operations. 

ARn is modified if src is specified in indirect mode. 

src general addressing modes (G): 

register (no operation) 

1 indirect (modify ARn, ^ n ^7) 



31 


24 23 




1615 


87 








110 1 


G 





1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 




Example 



If the src operand is specified in the indirect mode, the specified addressing 
operation is performed and a dummy memory read occurs. If the src oper- 
and is omitted, no operation is performed. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 

NOP 

Before Instruction: 



PC = 


3Ah 


Afte 


r Instruction: 


PC = 


3Bh 


NOP 


*AR3~(1) 


Befo 


re Instruction: 


PC = 
AR3 


5h 
= 809900h 


After Instruction: 



PC = 6h 

AR3 = 8098FFh 
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Normalize 



NORM 



Syntax 

Operation 

Operands 



Encoding 
31 



NORM <src>,<dst> 

norm {src) -* dst 

src general addressing modes (G): 
register (Rn, £ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 



24 23 

— I — I — I — I — I — I — \ — I — 
110 10 



1615 



1 — r 
dst 



87 

1 — I — I — I — I — I — I — I — I — r 
src 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



The src operand is assumed to be an unnormalized floating-point number, 
i.e., the implied bit is set equal to the sign bit. The dst is set equal to the 
normalized src operand with the implied bit removed. The dst operand ex- 
ponent is set to the src operand exponent minus the size of the left-shift 
necessary to normalize the src. The dst operand is assumed to be a nor- 
malized floating-point number. 

If 5rc(exp) = -128 and s/-c(man) = 0, then dst = 0,Z = ^, and UF = 0. If 
5Ac(exp) = -128 and 5/-c(man) ?£ 0, then dst = 0, Z = 0, and UF = 1. For 
all other cases of the src, if a floating-point underflow occurs, then 
dst(vmr\) is forced to and dst{ex^) = -128. If src{man) = 0, then 
dst{man) = and dst{exp) = -128. Refer to Section 5.6. 



1 

N 

Z 
V 

c 

UF 
LV 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 



Unaffected. 

1 if a floating-point underflow occurs, otherwise. 

Unaffected. 



LUF 1 if a floating-point underflow occurs, unchanged otherwise. 
OVM Operation not affected by OVM. 

NORM R1,R2 
Before Instruction: 

R1 = 0400003AF5h 
R2 = 070C800000h 
LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = 0400003AF5h 

R2 = F26BD40000h = 1.12451613e-04 

LUF LVUFNZVC = 0000000 
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NOT 



Bit>A/ise Logical-Complement 



Syntax 

Operation 

Operands 



NOT <src>,<dst> 
~src -* dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n ^ 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
110 11 


1 

G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description The bitwise logical-complement of the src operand is loaded into the dst 
register. The complement is formed by a logical- NOT of each bit of the src 
operand. The dst and src operands are assumed to be unsigned integers. 

Cycles 1 



Status Bits 


N 


MSB of the output. 




Z 


1 if a zero output is generated, otherwise 




V 







c 


Unaffected. 




UF 







LV 


Unaffected. 




LUF 


Unaffected. 


IVIode Bit 


OVM 


Operation not affected by OVM. 




Example NOT @982Ch,R4 

Before Instruction: 

DP = 80h 

R4 = Oh 

Data at 80982Ch = 5E2Fh 

LUFLVUFNZVC = 0000000 

After Instruction: 



DP = BOh 

R4 = 0FFFFA1 DOh 

Data at 80982Ch = 5E2Fh 

LUF LV UF N Z V C = 1 
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Parallel NOT and STI 



NOTIISTI 



Syntax 



Operation 



Operands 



Encoding 



NOT <src2>,<dst1> 
STI <src3>,<dst2> 

~src2 -> dsti 
src3 ->■ dst2 



src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dsti register (Rnl, ^ n1 ^7) 

src3 register (Rn2, £ n2 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1 ) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
10 11 


1 1 
dsti 





src3 


1 1 1 1 1 1 1 
dst2 


src2 



Description 



Cycles 
Status Bits 



A bitwise logical- NOT and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (NOT) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the NOT. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 

C Unaffected. 

UF 

LV Unaffected. 

LUF Unaffected. 



IVIode Bit 



OVM Operation not affected by OVM. 
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NOTIISTI 



Parallel NOT and STI 



Example NOT *+AR2,R3 

I I STI R7,*~AR4(IR1) 

Before Instruction: 

AR2 = 8099CBh 

R3 = Oh 

R7 = ODCh = 220 

AR4 = 809850h 

IR1 = lOh 

Data at 8099CCh = 0C2Fh 

Data at 809840h = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 



AR2 = 8099CBh 

R3 = 0FFFFF3D0h 

R7 = ODCh = 220 

AR4 = 809840h 

IR1 = lOh 

Data at 8099CCh = 0C2Fh 

Data at 809840h = ODCh = 220 

LUF LV UF N Z V C = 1 
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Bit\A/ise Logical-OR 



OR 



Syntax 

Operation 

Operands 



OR <src>,<c/st> 

dstOR src -* dst 

src general addressing modes (G): 
register (Rn, £ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate (not sign-extended) 

dst register (Rn, :S n ^ 27) 



Encoding 



31 


24 23 






1615 


87 








1 1 1 1 1 
10 


1 

G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 



Cycles 



The bitwise logical-OR between the src and dst operands is loaded into the 
dst register. The dst and src operands are assumed to be unsigned integers. 



Status Bits 


N 


MSB of the output. 




Z 


1 if a zero output is generated, otherwise 




V 







c 


Unaffected. 




UF 







LV 


Unaffected. 




LUF 


Unaffected. 


JVIode Bit 


OVM 


Operation not affected by OVM. 
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OR Bitwise Logical-OR 



Example OR *++ARl ( IRl ) , R2 

Before Instruction: 



AR1 = 809800h 

IRl = 4h 

R2 = 012560000h 

Data at 809804h = 2BCDh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809804h 

IRl = 4h 

R2 = 012562BCDh 

Data at 809804h = 2BCDh 

LUFLVUFNZVC = 0000000 
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Bitwise Logical-OR, 3-Operand 



0R3 



Syntax 

Operation 

Operands 



0R3 <src2>,<src1>,<dst> 

src 7 OR src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 rS 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, ^ n1 ^ 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 £ 27) 

1 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IRI) 

dst register (Rn, ^ n :S 27) 



Encoding 
31 



24 23 

T 1 1 T 1 1 1 1 

001001011 



1615 



87 



-1 — I — I — r 
srcl 



dst 



src2 



Description 

Cycles 
Status Bits 



The bitwise logical-OR between the srcl and src2 operands is loaded into 
the dst register. The srcl , src2, and dst operands are assumed to be un- 
signed integers. 



1 

N 

Z 
V 

c 

UF 
LV 



MSB of the output. 

1 if a zero output is generated, otherwise. 



Unaffected. 



Unaffected. 



Mode Bit 



LUF Unaffected. 

OVM Operation not affected by OVM. 
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OR3 Bitwise Logical-OR, 3-Operand 



Example 0R3 *++ARl(IRl) ,R2 ,R7 

Before Instruction: 

AR1 = 809800h 

IR1 = 4h 

R2 = 01 2560000h 

R7 =0h 

Data at 809804h = 2BCDh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809804h 

IR1 = 4h 

R2 = 012560000h 

R7 = 012562BCDh 

Data at 809804h = 2BCDh 

LUFLVUFNZVC = 0000000 
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Parallel OR3 and STI 



0R3IISTI 



Syntax 



Operation 



Operands 



Encoding 
31 



0R3 <src2>,<src1>,<dst1> 
II STI <src3> ,<dst2> 

srcl OR src2 -» dsti 
\\src3 -* dst2 

srcl register (Rnl, rs n1 <,!) 

src2 indirect (disp = 0, 1, IRO, IR1) 

dsti register (Rn2, ^ n2 ^ 7) 

src3 register (Rn3, ^ n3 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



1 — I — : — I — 
1110 10 



24 23 

— I — r 
dsti 



1615 



87 



srcl 



src3 



dst2 



src2 



Description 



Cycles 
Status Bits 



A bitwise logical-OR and an integer store are performed in parallel. All re- 
gisters are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register 
and the operation being performed in parallel (0R3) writes to the same re- 
gister, then STI accepts as input the contents of the register before it is 
modified by the 0R3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 

C Unaffected. 

UF 

LV Unaffected. 

LUF Unaffected. 



l\/lode Bit 



OVM Operation not affected by OVM. 
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OR3IISTI Parallel 0R3 and STI 



Example 0R3 *++AR2,R5,R2 

I I STI R6,*AR1 — 

Before Instruction: 



AR2 = 809830h 


















R5 = SOOOOOh 


















R2 = Oh 


















R6 = ODCh = 220 


















AR1 = 809883h 


















Data at 809831 h = 


9800h 
















Data at 809883h = 


Oh 
















LUF LV UF N Z 


V C = 























After Instruction 



















AR2 = 809831 h 

R5 = 800000h 

R2 = 809800h 

R6 = ODCh = 220 

AR1 = 809882h 

Data at 809831 h = 9800h 

Data at 809883h = ODCh = 220 

LUFLVUFNZVC = 0000000 
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POP Integer 



POP 



Syntax 
Operation 
Operands 
Encoding 



POP <dst> 
*SP-- -^ dst 
dst register (Rn, ^ n ^ 27) 



31 


24 23 






1615 87 





1110 


1 


dst 


0000000000000000 



Description 

Cycles 
Status Bits 



Mode Bit 



Example 



The top of the current system stack is popped and loaded into the dst reg- 
ister. The top of the stack is assumed to be a signed integer. The POP is 
performed with a post decrement of the stack pointer. 



1 

N 

Z 
V 

c 

UF 
LV 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 



Unaffected. 



Unaffected. 



LUF Unaffected. 

OVM Operation not affected by OVM. 



POP R3 

Before Instruction: 

SP = 809856h 

R3 = 012DAh = 4,826 

Data at 809856h = 0FFFF0DA4h = -62,044 

LUFLVUFNZVC = 0000000 

After Instruction: 

SP = 809855h 

R3 = 0FFFF0DA4h = -62,044 

Data at 809856h = 0FFFF0DA4h = -62,044 

LUF LV UF N Z V C = 1 
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POPF 



POP Floating-Point 



Syntax 
Operation 
Operands 
Encoding 



POPF <dst> 
*SP-- -* dsti 
dst register (Rn, < n < 7) 



31 


24 23 






1615 87 


1 1 



1 1 1 1 1 
1110 1 


1 
1 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
0000000000000000 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The top of the current system stack is popped and loaded into the dst reg- 
ister. The top of the staci< is assumed to be a floating-point number. The 
POP is performed with a post decrement of the stack pointer. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 



Unaffected. 



Unaffected. 



OVM Operation not affected by OVM. 

POPF R4 

Before Instruction: 

SP = 80984Ah 

R4 = 025D2E0123h = 6.91186578e+00 

Data at 80984Ah = 5F2C1302h = 5.32544007e+28 

LUFLVUFNZVC = 0000000 

After Instruction: 

SP = 809849h 

R4 = 5F2C130200h = 5.32544007e+28 

Data at 80984Ah = 5F2C1302h = 5.32544007e+28 

LUFLVUFNZVC = 0000000 
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PUSH Integer 



PUSH 



Syntax 
Operation 
Operands 
Encoding 



PUSH <src> 
src -* * ++SP 
src register (Rn, < n < 27) 



31 


24 23 






1615 87 


1 1 



1 1 1 1 1 
11110 


1 


1 1 1 1 
src 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
0000000000000000 



Description 

Cycles 
Status Bits 



Mode Bit 



Example 



The contents of the src register are pushed on the current system stack . The 
src is assumed to be a signed integer. The PUSH is performed with a pre- 
increment of the stack pointer. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 

PUSH R6 

Before Instruction: 

SP = 8098AEh 

R6 = 815Bh = 33,115 

Data at 8098AFh = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

SP = 8098AFh 

R6 = 815Bh = 33,115 

Data at 8098AFh = 81 5Bh = 33,1 1 5 

LUFLVUFNZVC = 0000000 
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PUSHF 



PUSH Floating-Point 



Syntax 
Operation 
Operands 
Encoding 



PUSHF <src> 

src -* * + +SP 

src register (Rn, ^ n ^ 7) 



31 


24 23 






1615 87 


1 1 



1 1 1 1 1 
11111 


1 
1 


1 1 1 1 

src 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
0000000000000000 



Description The contents of the src register are pushed on the current system stack . The 
src is assumed to be a floating-point number. The PUSH is performed with 
a preincrement of the stack pointer. 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM Operation not a 


Example 


PUSHF R2 




Before Instruction: 




SP = 809801 h 

R2 = 025C1 28081 h = 6.87725854e+00 
Data at 809802h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

SP = 809802h 

R2 = 025C1 28081 h = 6.87725854e+00 

Data at 809802h = 025C1 280h = 6.87725830e+00 

LUFLVUFNZVC = 0000000 
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Return From Interrupt Conditionally 



RET\cond 



Syntax 
Operation 



Operands 
Encoding 



REJlcond 

If cond is true: 
•SP-- -* PC 
1 -^ST(GIE). 

Else, continue. 

None 



31 


24 23 






1615 87 


1 1 1 1 
1111 


1 1 1 






1 1 1 1 
cond 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
0000000000000000 



Description 



Cycles 
Status Bits 



l\/Iode Bit 
Example 



A conditional return is performed. If the condition is true, the top of the 
stack is popped to the PC, and 1 is written to the global interrupt enable 
(GIE) bit of the status register. This has the effect of enabling all interrupts 
for which the corresponding interrupt enable bit is a 1 . 

The TI\/IS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 9.1 for a list of condition mnemonics, encoding, 
and flags). 



N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 



RETINZ 



Before Instruction: 



PC = 456h 

SP = 809830h 

ST = 0h 

Data at 809830h = 

LUF LV UF N Z 


123h 
V C = 




















After Instruction 


















PC = 123h 

SP = 80982Fh 

ST = 2000h 

Data at 809830h = 1 23h 

LUFLVUFNZVC = 0000000 
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RETScond 



Return From Subroutine Conditionally 



Syntax 
Operation 

Operands 
Encoding 



RETScond 

If cond is true: 

*SP-- - PC. 

Else, continue. 

None 



31 


24 23 






1615 87 


1 1 1 1 
1111 


1 i 1 
1 





1 1 1 1 
cond 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
0000000000000000 



Description 



Cycles 




A conditional return is performed. If the condition is true, the top of the 
stack is popped to the PC, and 1 is written to the global interrupt enable 
(GIE) bit of the status register. This has the effect of enabling all interrupts 
for which the corresponding interrupt enable bit is a 1 . 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 9.1 for a list of condition mnemonics, encoding, 
and flags). 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


l\/lode Bit 


OVM 


Operation n 



Example RETSGE 

Before Instruction: 

PC = 123h 

SP = 80983Ch 

Data at 80983Ch = 456h 

LUF LV UF N Z V C = 

After Instruction: 







PC = 456h 

SP = 80983Bh 

Data at 80983Ch = 456h 

LUFLVUFNZVC = 0000000 
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Round Floating-Point 



RND 



Syntax 

Operation 

Operands 



RND <src><dst> 

rnd(sAc) -► dst 

src general addressing modes (G): 
register (Rn, :S n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
10 10 


1 
G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



The result of rounding the src operand is loaded into the dst register. The 
src operand is rounded to the nearest single-precision floating-point value. 
If the src operand is exactly half-way between two single-precision values, 
it is rounded to the most positive of those values. 

1 

N 1 if a negative result is generated, otherwise, 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



OVM Operation not affected by OVM. 

RND R5,R2 
Before instruction: 



R5 = 0733C16EEFh = 1.79755599e+02 

R2 - Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R5 = 0733C16EEFh = 1 .79755599e+02 
R2 = 0733C16F00h = 1.79755600e+02 
LUFLVUFNZVC = 0000000 
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ROL 



Rotate Left 



Syntax 
Operation 
Operands 
Encoding 



ROL <dst> 

dst I eft -rotated 1 bit -♦ dst 

dst register (Rn, ^ n ^ 27) 



31 


24 23 






1615 87 


1 1 



1 1 1 1 1 
10 11 


1 

1 1 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 

0000000000000001 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The contents of the dst operand are left-rotated one bit and loaded into the 
dst register. This rotate is a circular rotate with the MSB transferred into the 
LSB. 

Rotate left: 

C *j dst 



*\ dst*^ 



MSB of the output. 

1 if a zero output is generated, otherwise. 


Set to the value of the bit rotated out of the high-order bit. Unaf- 
fected if dst is not R0-R7. 

Unaffected. 



1 

N 

Z 
V 

c 

UF 
LV 

LUF Unaffected. 

OVM Operation not affected by OVM. 

ROL R3 

Before Instruction: 

R3 = 80025CD4h 

LUFLVUFNZVC = 0000000 

After Instruction: 

R3 = 0004B9A9h 

LUF LVUFNZVC = 0000001 
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Rotate Left Through Carry 



ROLC 



Syntax 
Operation 
Operands 
Encoding 



ROLC <dst> 

dst left-rotated 1 bit through carry bit -♦ dst 

dst register (Rn, < n :S 27) 



31 


24 23 






1615 87 


1 1 



1 1 1 1 1 
10 10 


1 

1 1 


1 1 1 1 

dst 


1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 
0000000000000001 



Description 



The contents of the dst operand are left-rotated one bit through the carry 
bit and loaded into the dst register. The MSB is rotated to the carry bit, at 
the same time the carry bit is transferred to the LSB. 

Rotate left through carry bit: 
dst 



f- C^ dst*-\ 



Cycles 
Status Bits 



Mode Bit 
Example 



Example 



N 

Z 
V 

c 

UF 
LV 
LUF 



MSB of the output. 

1 if a zero output is generated, otherwise. 



Set to the value of the bit rotated out of the high-order bit. If dst is 

not R0-R7, then C is shifted into the dst but not changed. 



Unaffected. 

Unaffected. 



OVM Operation not affected by OVM. 

ROLC R3 

Before Instruction: 

R3 = 00000420h 

LUF LV UF N Z V C = 1 

After Instruction: 

R3 = 000000841 h 
LUFLVUFNZVC = 0000000 

ROLC R3 

Before Instruction: 




R3 = 80004281 h 

LUF LV UF N Z V C 

After Instruction: 

R3 = 00008502h 

LUF LV UF N Z V C 







1 
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ROR 



Rotate Right 



Syntax 
Operation 
Operands 
Encoding 



ROR <dst> 

dst right-rotated 1 bit through carry bit -♦ dst 

dst register (Rn, ^ n ^ 27) 



31 


24 23 






1615 87 





10 10 1 


1 1 


dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1111111111111111 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The contents of the dst operand are right-rotated one bit and loaded into 
the dst register. The LSB is rotated into the carry bit and also transferred 
into the MSB. 

Rotate right: 
dst 



|-» dst -I* 



N 

Z 
V 

c 

UF 
LV 
LUF 



MSB of the output. 

1 if a zero output is generated, otherwise. 


Set to the value of the bit rotated out of the low-order bit. Unaf- 
fected if dst is not R0-R7. 


Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 

ROR R7 

Before Instruction: 

R7 = 00000421 h 

LUFLVUFNZVC = 0000000 

After Instruction: 

R7 = 8000021 Oh 

LUF LV UF N Z V C = 1 1 
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Rotate Right Through Carry 



RORC 



Syntax 
Operation 
Operands 
Encoding 



RORC <dst> 

dst right- rotated 1 bit through carry bit -* dst 

dst register (Rn, £ n ^ 27) 



31 


24 23 






1615 87 


1 1 



1 1 1 1 1 
10 110 


1 

1 1 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 
1111111111111111 



Description 



The contents of the dst operand are right-rotated one bit through the carry 
bit and loaded into the dst register. The LSB is rotated into the carry bit. 
At the same time, the carry bit is transferred into the MSB. 

Rotate right through carry bit: 

C -» dst 



r-> L -♦ dst -h 



Cycles 


1 




Status Bits 


N 

Z 
V 

c 

UF 
LV 
LUF 


iVISB of the output. 

1 if a zero output is generated, otherwise. 



Set to the value of the bit rotated out of the low-order bit 

not R0-R7, then C is shifted in but not changed. 



Unaffected. 

Unaffected. 


Mode Bit 


OVM 


Operation not affected by OVM. 


Example 


RORC 


R4 



If dst is 



Before Instruction: 

R4 = 00000081 h 

LUF LV UF N Z V C = 1 

After Instruction: 



R4 = 80000040h 

LUF LV UF N Z V C = 1 1 
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RPTB 



Repeat Block 



Syntax 
Operation 

Operands 
Encoding 



RPTB <src> 

src -* RE 
1 - ST(RM) 
Next PC -♦ RS 

src long-immediate addressing mode 



31 24 23 


1615 




87 





1 1 1 1 1 1 1 
110 10 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




RPTB allows a block of instructions to be repeated a number of times 
without any penalty for looping. This instruction activates the block repeat 
mode of updating the PC. The src operand is a 24-bit unsigned immediate 
value that is loaded into the repeat end address (RE) register. A 1 is written 
into the repeat mode bit of status register ST(RM) to indicate that the PC 
is being updated in the repeat mode. The address of the next instruction is 
loaded into the repeat start address (RS) register. 



N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 
RPTB 12 7h 
Before Instruction: 

PC = 123h 

ST = Oh 

RE = Oh 

RS = 0h 

LUF LVUFNZVC = 0000000 

After Instruction: 

PC = 124h 
ST = 1 0Oh 
RE = 127h 
RS = 124h 
LUFLVUFNZVC = 0000000 
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Repeat Single 



RPTS 



Syntax 
Operation 



Operands 



Encoding 



RPTS <src> 

src -^ RC 
1 -* ST(RM) 
1 -S 

Next PC - RS 
Next PC - RE 

src general addressing modes (G): 
register 

1 direct 

1 indirect 

1 1 immediate 



31 


24 23 




1615 


87 








1 1 1 1 1 
10 111 


1 
G 


1 1 1 1 
110 11 


1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 
src 



Description 



Cycles 



The RPTS instruction allows a single instruction to be repeated a number 
of times without any penalty for looping. Fetches can also be made from the 
instruction register (IR), thus avoiding repeated memory access. 

The src operand is loaded into the repeat counter (RC). A 1 is written into 
the repeat mode bit of the status register ST(RM). A 1 is also written into 
the repeat single bit (S). This indicates that the program fetches are to be 
performed only from the instruction register, the repeat single mode. The 
next PC is loaded into the repeat end address (RE) register and the repeat 
start address (RS) register. 

The src operand is assumed to be an unsigned integer and is not sign-ex- 
tended for immediate mode. 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


l\/lode Bit 


OVM 


Operation n 
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RPTS Repeat Single 

Example RPTS AR5 

Before Instruction: 

PC = 123h 

ST = Oh 

RS = 0h 

RE = 0h 

RC = 0h 

AR5 = OFFh 

LUFLVUFNZVC = 0000000 

After Instruction: 



PC = 124h 

ST = 1 0Oh 

RS = 124h 

RE ^ 124h 

RC = OFFh 

AR5 = OFFh 

LUFLVUFNZVC = 0000000 
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Signal, Interlocked 



SIGI 



Syntax 
Operation 

Operands 
Encoding 



SIGI 

Signal Interlocked operation. 
Wait for interlock acknowledge. 
Clear interlock. 

None 



31 


24 23 




1615 87 





10 110 





000000000000000000000 



Description An interlocked operation is signaled over XFO and XF1 . After the interlocked 
operation is acknowledged, the interlocked operation ends. SIGI ignores 
the external ready signals. Refer to Section 7.3 for detailed information. 



Cycles 



1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


/VIode Bit 


OVM 


Operation n 
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STF 



Store Floating-Point 



Syntax 

Operation 

Operands 



Encoding 
31 



STF <src>,<dst> 

src -» dst 

src register (Rn, ^ n ^ 7) 

dst general addressing modes (G): 

1 direct 

1 indirect 



24 23 



1615 



87 



— I — I — I — I — I — I — I — I — 
1 1 



T — I — \ — r 
src 



-1 — I — I — I — I — I — I — I — \ — I — I — I — I — I — r 
dst 



Description 

Cycles 
Status Bits 



l\/lode Bit 
Example 




The src register is loaded into the dst memory location. The src and dst 
operands are assumed to be floating-point numbers. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 

STF R2,@98Alh 

Before Instruction: 

DP = 80h 

R2 = 052C501900h = 4.30782204e+01 
Data at 8098A1 h = Oh 
LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R2 = 052C501900h = 4.30782204e+01 

Data at 8098A1h = 52C5019h = 4.30782204e+01 

LUFLVUFNZVC = 0000000 
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Store Floating-Point, Interiociced 



STFI 



Syntax 
Operation 

Operands 



Encoding 



STFI <src>,<dst> 

src -*■ dst 

Signal end of interlocked operation. 

src register (Rn, ^ n :S 7) 

dst genera! addressing modes (G): 

1 direct 

1 indirect 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
10 10 1 


1 
G 


1 1 1 1 
src 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
dst 



Description The src register is loaded into the dst memory location. An interlocked op- 
eration is signaled over pins XFO and XF1 . The src and dst operands are 
assumed to be floating-point numbers. Refer to Section 7.3 for detailed 
information. 



Cycles 



1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UP 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation not a 


Example 


STFI 


R3,*-AR4 




Before Instruction: 



R3 = 0733C00000h = 1.79750e+02 
AR4 = 80993Ch 
Data at 80993Bh = Oh 
LUFLVUFNZVC = OO00O00 

After instruction: 




R3 = 0733C00000h = 1.79750e+02 

AR4 = 80993Ch 

Data at 80993Bh = 733C000h = 1.79750e+02 

LUFLVUFNZVC = 0000000 
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STFIiSTF 



Parallel STF and STF 



Syntax 

Operation 

Operands 

Encoding 



STF <src2> ,<dst2> 
STF <src1><dst^> 



src2 
srcl 



dstl 
dst^ 



srcl register (Rnl, ^ n1 <, 7) 

dstl indirect (disp = 0, 1, IRO, IR1) 

src2 register (Rn2, ^ n2 < 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1 ) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 



1 1 
src2 


1 1 



1 1 
srcl 


1 1 1 1 1 1 1 
dst1 


1 1 1 1 1 1 1 
dst2 



Description Two floating-point stores are performed in parallel. If both stores are exe- 
cuted to the same address, the value written is that of STF 
<src2> ,<dst2> . 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 
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Parallel STF and STF STF||STF 



Example STF R4,*AR3 — 

I I STF R3,*++AR5 

Before Instruction: 



R4 - 070C800000h = 1.4050e+02 

AR3 = 809835h 

R3 = O733C00000h = 1.79750e+02 

AR5 = 8099D2h 

Data at 809835h = Oh 

Data at 8099D3h = Oh 

LUFLVUFNZVC^OOOOOOO 

After Instruction: 



R4 = 070C800000h = 1.4050e+02 

AR3 = 809834h 

R3 = 0733C00000h = 1.79750e+02 

AR5 = 8099D3h 

Data at 809835h = 070C8000h = 1 .4050e+02 

Data at 8099D3h = 0733C000h = 1.79750e+02 

LUFLVUFNZVC = 0000000 
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STI 



Store Integer 



Syntax 

Operation 

Operands 



Encoding 



STI <src>.<dst> 

src -* dst 

src register (Rn, :S n ^ 27) 

dst general addressing modes (G): 

1 direct 

1 indirect 



31 


24 23 






1615 


87 








10 10 10 


G 


src 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
dst 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 




The src register is loaded into the dst memory location. The src and dst 
operands are assumed to be signed integers. 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



OVM Operation not affected by OVM. 
STI R4,(a982Bh 
Before Instruction: 



DP = 80h 

R4 = 42BD7h = 273,367 

Data at 80982Bh = OEBFCh = 58,876 

LUFLVUFNZVC = 0000000 

After Instruction: 



DP = 80h 

R4 = 42BD7h = 273,367 

Data at 80982Bh = 42BD7h = 273,367 

LUFLVUFNZVC = 0000000 
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Store integer. Interlocked 



STII 



Syntax 
Operation 

Operands 



Encoding 



STII <src>,<dst> 

src -*■ dst 

Signal end of Interlocked operation. 

src register (Rn, £ n ^ 27) 

dst general addressing modes (G): 
1 direct 
1 indirect 



31 


24 23 






1615 


87 








1 1 1 1 1 
10 10 11 


1 
G 


1 1 1 1 
src 


1 1 1 1 1 1 1 1 i I J 1 I 1 1 
dst 



Description The src register is loaded into the dst memory location. An interlocked op- 
eration is signaled over pins XFO and XF1 . The src and dst operands are 
assumed to be signed integers. Refer to Section 7.3 for detailed information. 

Cycles 1 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 



Example STII Rl,(a98AEh 

Before Instruction: 

DP = 80h 

R1 = 78Dh 

Data at 8098AEh = 25Ch 

After Instruction: 

DP = 80h 

R1 = 78Dh 

Data at 8098AEh = 7BDh 
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STHISTI 



Parallel STI and STI 



Syntax 

Operation 

Operands 

Encoding 



STI <src2> ,<dst2> 
II STI <src7>,<dsn> 

src2 -+ dstl 
\\src1 -* dst[ 

srcl register (Rnl, :S n1 ^7) 

dst^ indirect (disp = 0, 1, I RO, I R1) 

src2 register (Rn2, ^ n2 ^ 7) 

dstl indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 
1 1 


1 1 1 1 
1 


1 1 
src2 


1 1 



1 1 
srcl 


1 1 1 1 1 1 1 
dstl 


1" 1 1 1 1 1 1 
dst2 



Description 



Cycles 
Status Bits 



Two integer stores are performed in parallel. If both stores are executed to 
the same address, the value written is that of STI <src2> <dst2> . 



1 

N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



Mode Bit 



OVM Operation not affected by OVM. 
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Parallel STI and STI STI||STI 



Example STI RO, *++AR2 (IRO) 

I I STI R5,*AR0 

Before Instruction: 



RO = ODCh = 220 




AR2 = 809830h 




IRO = 8h 




R5 = 35h = 53 




ARO = 8098D3h 




Data at 809838h = Oh 




Data at 8098D3h = Oh 




LUF LV UF N Z V C = 


= 0000000 


After Instruction: 





RO = ODCh = 220 

AR2 = 809838h 

IRO = 8h 

R5 = 35h = 53 

ARO = 8098D3h 

Data at 809838h = ODCh = 220 

Data at 8098D3h = 35h = 53 

LUFLVUFNZVC = 0000000 
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SUBB 



Subtract Integer \rj\th Borrow 



Syntax 

Operation 

Operands 



SUBB <src>,<dst> 
dst - src - C -* dst 

src general addressing modes (G): 

register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
10 110 1 


1 

G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




The difference of the dst, src, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation affected by OVM. 

SUBB *AR5++(4),R5 
Before Instruction: 

AR5 = 809800h 

R5 = OFAh = 250 

Data at 809800h = 0C7h = 1 99 

LUF LVUFNZVC = 0000001 

After Instruction: 

AR5 = 809804h 

R5 = 032h = 50 

Data at 809800h = 0C7h = 199 

LUFLVUFNZVC = 0000000 
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Subtract Integer with Borro\A/, 3-Operand 



SUBB3 



Syntax 

Operation 

Operands 



SUBB3 <src2>,<src7>,<dst> 

srcl - src2 - C -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, <, n^ <, 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rnl, ^ n1 ^ 27) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 £ 27) 
10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, ^ n ^ 27) 



Encoding 
31 



24 23 

— 1 — I — I — I — I — I — I — I — 
10 110 



1615 



87 



T — I — I — r 



1 — I — I — r 



dst 



srcl 



src2 



Description 



Cycles 
Status Bits 



Mode Bit 



The difference of the srcl and src2 operands and the C (carry) flag is 
loaded into the dst register. The src7, src2, and dst operands are assumed 
to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation affected by OVM. 
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SUBB3 Subtract Integer with Borrow, 3-Operand 

Example SUBB3 R5 , *AR5++ ( IRO ) , RO 

Before Instruction: 

AR5 = 809800h 

IRO = 4h 

R5 = 0C7h = 199 

RO = Oh 

Data at 809800h = OFAh = 250 

LUF LVUFNZVC = 0000001 

After Instruction: 

AR5 = 809804h 

IRO = 4h 

R5 = 0C7h = 199 

RO = 32h = 50 

Data at 809800h = OFAh = 250 

LUFLVUFNZVC = 0000000 
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Subtract Integer Conditionally 



SUBC 



Syntax 
Operation 



Operands 



SUBC <src>,<c/st> 

If {dst - src >0): 

{dst - src << 1 ) OR 1 -^ dst 
Else: 

dst << ^ ->■ dst 

src general addressing modes (G): 
register (Rn, < n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n :^ 27) 



Encoding 



31 


24 23 






1615 


87 








10 1110 


G 


dst 


src 



Description 



Cycles 



A subtraction of the src operand from the dst operand is performed. The 
dst operand is loaded with a value dependent upon the result of the sub- 
traction. If {dst - src) is greater than or equal to zero, then {dst - src) is 
left-shifted one bit, the least-significant bit is set to 1, and the result is 
loaded into the dst register. If {dst - src) is less than zero, dst is left-shifted 
one bit and loaded into the dst register. The dst and src operands are as- 
sumed to be unsigned integers. 

SUBC may be used to perform a single step of a multi-bit integer division. 
See Section 12.3.3 for a detailed description. 



Status Bits 


N 


Unaffected 




Z 


Unaffected 




V 


Unaffected 




c 


Unaffected 




UF 


Unaffected 




LV 


Unaffected 




LUF 


Unaffected 


IVIode Bit 


OVM 


Operation r 
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SUBC 



Subtract Integer Conditionally 



Example 



Example 



SUBC @98C5h,Rl 
Before Instruction: 

DP = 80h 

R1 =04F6h = 1270 

Data at 8098C5h = 492h = 1170 

LUFLVUFNZVC = 0000000 

After instruction: 

DP = BOh 

R1 = 0C9h = 201 

Data at 8098C5h = 492h = 1 170 

LUFLVUFNZVC = 0000000 

SUBC 3000, RO (3000 = 0BB8h) 

Before Instruction: 

RO = 07D0h = 2000 
LUFLVUFNZVC = 0000000 

After Instruction: 

RO = OFAOh = 4000 
LUFLVUFNZVC = 0000000 
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Subtract Floating-Point 



SUBF 



Syntax 

Operation 

Operands 



SUBF <src>,<cfst> 

dst - src -* dst 

src general addressing modes (G): 
register (Rn, ^ n ^ 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 7) 



Encoding 



31 


24 23 






1615 


87 








1 1 1 1 1 
10 1111 


1 
G 


1 1 1 1 
dst 


1 1 i 1 1 1 I 1 1 1 1 1 1 1 1 
src 



Description 



Cycles 
Status Bits 



IVIode Bit 



Example 



The dst operand minus the src operand is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an floating-point overflow occurs, otherwise. 

C Unaffected 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if an floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

SUBF *ARO — ( IRO ) , R5 

Before Instruction: 

ARO = 809888h 

IRO - 80h 

R5 = 0733C00000h = 1.79750000e+02 

Data at 809888h = 70C8000h = 1 .4050e+02 

LUFLVUFNZVC = 0000000 

After Instruction: 

ARO = 809808h 

IRO = 80h 

R5 = 051 DOOOOOOh = 3.9250e+01 

Data at 809888h = 70C8000h = 1 .4050e+02 

LUFLVUFNZVC = 0000000 
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SUBF3 



Subtract Floating-Point, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



SUBF3 <src2><src1>,<dst> 

srcl - src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 ^7) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rnl, ^ n1 < 7) 

11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 7) 

1 register (Rn2, ^ n2 < 7) 

10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, <, n <,!) 



31 


24 23 






1615 




87 







1 


1 1 1 1 1 
110 1 


1 

T 


1 1 1 1 
dst 


1 1 1 1 I 1 1 
srcl 


1 1 1 1 1 1 1 
src2 



Description 

Cycles 
Status Bits 



IVIode Bit 



The difference of the src7 and src2 operands is loaded into the dst register. 
The srcl, src2, and dst operands are assumed to be floating-point numbers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 
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Subtract Floating-Point, 3-Operand 



SUBF3 



Example 



Example 



SUBF3 *ARO — ( IRO ) , *AR1 , R4 
Before Instruction: 

ARO = 809888h 

IRO = 80h 

AR1 = 809851 h 

R4 = Oh 

Data at 809888h = 70C8000h - 1 .4050e+02 

Data at 809851 h = 733C000h = 1.79750e+02 

LUF LVUFNZVC = 0000000 

After Instruction: 

ARO = 809808h 

IRO = 80h 

AR1 = 809851 h 

R4 = 51 DOOOOOOh = 3.9250e+01 

Data at 809888h = 70C8000h = 1.4050e+02 

Data at 809851 h = 733C000h = 1 .79750e+02 

LUF LVUFNZVC = 0000000 

SUBF3 R7,R0,R6 
Before Instruction: 



R7 = 57B400000h = 6.281 250e+01 

RO = 34C200000h = 1. 275781 25e+01 

R6 = 0h 

LUF LVUFNZVC = 0000000 

After Instruction: 

R7 = 57B400000h = 6.281 250e+01 
RO = 34C200000h = 1. 275781 25e+01 
R6 = 5B7C80000h = -5.00546875e+01 
LUF LV UF N Z V C = 1 
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SUBF3IISTF 



Parallel SUBF3 and STF 



Syntax 



Operation 



Operands 



Encoding 



SUBF3 <src1>,<src2>,<dst^> 
II STF <src3> ,<dst2> 

src2 - srcl -* dst^ 
\\src3 -* dstl 

src1 register (Rnl, ^ n1 ^7) 

src2 indirect (disp = 0, 1, I RO, I R1) 

dstl register (Rn2, s n2 ^ 7) 

src3 register (Rn3, ^ n3 ^ 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 






24 23 




1615 




8 




1 


1 


10 1 


dstl 


srcl 


1 1 
src3 


1 1 1 1 1 1 1 
dst2 


1 1 1 1 1 1 " 1 
src2 



Description 



Cycles 
Status Bits 



A floating-point subtraction and a floating-point store are performed in 
parallel. All registers are read at the beginning and loaded at the end of the 
execute cycle. This means that if one of the parallel operations (STF) reads 
from a register and the operation being performed in parallel (SUBF3) 
writes to the same register, then STF accepts as Input the contents of the 
register before it is modified by the SUBF3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

UP 1 if a floating-point underflow occurs, otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 



Mode Bit 



OVM Operation not affected by OVM. 
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Parallel SUBF3 and STF SUBF3IISTF 



Example SUBF3 Rl, *-AR4 (IRl) ,R0 

I I STF R7,*+AR5(IR0) 

Before Instruction: 

Rl = 057B400000h = 6.281 25e+01 

AR4 = 8098B8h 

IRl = 8h 

RO = Oh 

R7 = 0733C00000h = 1 .79750e+02 

AR5 = 809850h 

IRO = lOh 

Data at 8098B0h = 70C8000h = 1.4050e+02 

Data at 809860h = Oh 

LUF LVUFNZVC = 0000000 

After Instruction: 

Rl = 057B400000h = 6.281 25e+01 

AR4 = 8098B8h 

IRl = 8h 

RO = 061 B600000h = 7.768750e+01 

R7 = O733C00O00h = 1.79750e+02 

AR5 = 809850h 

IRO = lOh 

Data at 8098B0h = 70C8000h = 1.4050e+02 

Data at 809860h = 733C000h - 1.79750e+02 

LUFLVUFNZVC = 0000000 
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SUB! 



Subtract Integer 



Syntax 

Operation 

Operands 



SUBI <src>,<dst> 
dst - src -* dst 



src 



general addressing modes (G): 
register (Rn, < n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, £ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
110 


1 

G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 "1 1 "1 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 




The dst operand minus the src operand is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 



1 

N 

Z 
V 

c 

UF 
LV 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

1 if a borrow occurs, otherwise. 



1 if an integer overflow occurs, unchanged otherwise. 



LUF Unaffected. 

OVM Operation affected by OVM. 

SUBI 220, R7 
Before Instruction: 



R7 = 226h = 550 
LUF LV UF N Z V C 

After Instruction: 







R7 = 14Ah = 330 

LUFLVUFNZVC = 0000000 
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Subtract Integer, 3-Operand 



SUBI3 



Syntax 

Operation 

Operands 



SUBI3 <src2>,<src1>,<dst> 

srcl - src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 ^ 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rnl, ^ n1 £ 27) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 < 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 
11 indirect (disp = 0, 1, IRO, I R1) 

dst register (Rn, ^ n i^ 27) 



Encoding 



31 


24 23 






1615 




87 







1 1 
1 


1110 


T 


1 1 1 1 
dst 


1 1 i- 1 1 ■ I — 
srcl 


1 1 1 — r r--! — 1 — 
src2 



Description 

Cycles 
Status Bits 



l\/lode Bit 
Example 



The srcl operand minus the src2 operand is loaded into the dst register. The 
srcl , src2, and dst operands are assumed to be signed integers. 



1 

N 

Z 
V 

c 

UF 
LV 



1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

1 if a borrow occurs, otherwise. 



1 if an integer overflow occurs, unchanged otherwise. 



LUF Unaffected. 

OVM Operation affected by OVM. 

SUBI3 R7,R2,R0 

Before Instruction: 



R2 = 0866h = 2150 

R7 = 0834h = 2100 

RO = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

R2 = 0866h = 2150 

R7 = 0834h = 21 00 

RO = 032h = 50 

LUF LV UF N Z V C = 1 
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SUBI3 Subtract Integer, 3-Operand 

Example SUBI3 *-AR2 ( 1) ,R4,R3 

Before Instruction: 

AR2 = 80985Eh 

R4 = 0226h = 550 

R3 = 0h 

Data at 80985Dh = ODCh = 220 

LUFLVUFNZVC = 0000000 

After Instruction: 




AR2 = 80985Eh 

R4 = 0226h = 550 

R3 = 014Ah = 330 

Data at 80985Dh = ODCh = 220 

LUFLVUFNZVC = 0000000 
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Parallei SUBI3 and STI 



SUBI3IISTI 



Syntax 



Operation 



Operands 



Encoding 



SUBI3 <src1><src2>,<dst^> 
STI <src3> ,<dst2> 



src2 
\\src3 

srcl 
src2 
dst1 
src3 
dst2 



srcl 
dstl 



dsn 



register (Rnl, ^ n1 ^7) 
indirect (disp = 0, 1, IRO, IR1) 
register (Rn2, ^ n2 ^ 7) 
register (Rn3, ^ n3 ^ 7) 
indirect (disp = 0, 1, IRO, IR1) 



31 




24 23 




1615 




87 







1 1 


10 110 


dstl 


srcl 


src3 


1 1 1 1 1 1 1 
dst2 


src2 



Description 



Cycles 
Status Bits 



An integer subtraction and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (SUBI3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the SUBI3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



Mode Bit 



OVM Operation affected by OVM. 
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SUBI3IISTI Parallel SUBI3 and STI 



Example SUBI3 R7,*+AR2 (IRO) ,R1 

I I STI R3,*++AR7 

Before Instruction: 



R7 = 14h = 20 

AR2 = 80982Fh 

IRO = lOh 

R1 = Oh 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 



R7 = 14h = 20 

AR2 = 80982Fh 

IRO = lOh 

R1 = 0C8h = 200 

R3 = 35h = 53 

AR7 = 80983Ch 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = 35h = 53 

LUFLVUFNZVC = 0000000 
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Subtract Reverse Integer \A/ith Borrow 



SUBRB 



Syntax 

Operation 

Operands 



SUBRB <src>,<dst> 

src - dst - C -> dst 

src general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, < n £ 27) 



Encoding 
31 



24 23 



1615 



87 



— I — 1 — I — I — I — I — I — I — 
110 1 



n I I r 

dst 



T I I r 



Description 



Cycles 
Status Bits 



Mode Bit 



Example 



The difference of the src, dst, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 



OVM Operation affected by OVM. 
SUBRB R4,R6 
Before Instruction: 



R4 = 03CBh = 971 
R6 = 0258h = 600 
LUF LV UF N Z V C 

After Instruction: 

R4 = 03CBh = 971 
R6 = 0172h = 370 
LUF LV UF N Z V 



1 




C = 0000000 
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SUBRF 



Subtract Reverse Floating-Point 



Syntax 

Operation 

Operands 



SUBRF <src>,<dst> 

src - dst -» dst 

src general addressing modes (G): 
register (Rn, ^ n < 7) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, :S n £ 7) 



Encoding 



31 


24 23 






1615 


87 








110 10 


1 
G 


dst 


src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 




Tlie src operand minus the dst operand is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 



1 

N 

Z 
V 

c 

UF 
LV 



1 if a negative result is generated, otherwise. 
1 if a zero result is generated, otherwise. 
1 if a floating-point overflow occurs, otherwise. 
Unaffected. 

1 if a floating-point underflow occurs, otherwise. 
1 if a floating-point overflow occurs, unchanged otherwise. 
LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

OVM Operation not affected by OVM. 

SUBRF @9905h,R5 
Before Instruction: 

DP = 80h 

R5 = 057B400000h = 6.281 250e+01 

Data at 809905h = 733C000h = 1.79750e+02 

LUFLVUFNZVC = 0000000 

After Instruction: 

DP = 80h 

R5 = 0669E00000h = 1.16937500e+02 
Data at 809905h = 733C000h = 1 .79750e+02 
LUFLVUFNZVC = 0000000 
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Subtract Reverse Integer 



SUBRI 



Syntax 


SUBRI <src>,<dst> 






Operation 


src - dst -> dst 






Operands 


src general addressing modes (G): 
register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, £ n £ 27) 






Encoding 








31 


2423 1615 


87 








1 
1 1 


11 


1 
G 


1 1 1 1 
dst 


1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The src operand minus the dst operand is loaded into the dst register. The 
dst and src operands are assumed to be signed integers. 

1 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

UF 

LV 1 if an integer overflow occurs, unchanged otherwise. 

LUF Unaffected. 

OVM Operation affected by OVM. 

SUBRI *AR5++(IR0) ,R3 
Before Instruction: 

AR5 = 809900h 

IRO = 8h 

R3 = ODCh = 220 

Data at 809900h = 226h = 550 

LUFLVUFNZVC = OO0OOOO 

After Instruction: 

AR5 = 809908h 

IRO = 8h 

R3 = 014Ah = 330 

Data at 809900h = 226h = 550 

LUF LVUFNZVC = 0000000 
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SWI 



Software Interrupt 



Syntax 
Operation 
Operands 
Encoding 



SWI 

Performs an emulation interrupt. 

None 



31 


2423 1615 87 


1 1 1 1 1 1 
110 11 


0000000000000000000000000 



Description The SWI instruction performs an emulator interrupt. This is a reserved in- 
struction and should not be used in normal programming. 

Cycles 4 



Status Bits 


N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 




UF 


Unaffected. 




LV 


Unaffected. 




LUF 


Unaffected. 


Mode Bit 


OVM 


Operation n 
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Trap Conditionally 



TRAPcond 



Syntax 
Operation 



Operands 
Encoding 



TRAPcond N 

-* ST(GIE) 
If cond is true: 

Next PC -♦ * + + SP 

Trap vector N -♦ PC 
Else: 

Set ST(GIE) to original state 

Continue. 

N (0 < N ^ 31) 



31 


24 23 




1615 87 







1110 10 





cond 


00000000001 


N 



Description 



Cycles 



Interrupts are disabled globally when is written to ST(GIE). If the con- 
dition is true, the contents of the PC are pushed on the system stack and 
the PC is loaded with the contents of the specified trap vector (N). If the 
condition is not true, ST(GIE) is set to its value before the TRAPcond in- 
struction changed it. 

The TMS320C30 provides 20 condition codes that can be used with this 
instruction (see Section 9.1 for a list of condition mnemonics, encoding, 
and flags). 



Status Bits N 

Z 
V 

c 

UF 
LV 
LUF 



Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 
Unaffected. 



Mode Bit OVM Operation not affected by OVM. 

Example TRAPZ 16 



Before Instruction: 



PC = 


123h 






SP = 


809870h 






ST = 


Oh 






Trap Vector 1 6 = 


= lOh 




LUF 


LV UF N 


Z V 


C 


Afte 


r Instruction: 










PC = lOh 

SP = 809871 h 

Data at 809871 h = 

ST = Oh 

LUF LV UF N Z 



124h 



VC = 0000000 
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TSTB 



Test Bit Fields 



Syntax 

Operation 

Operands 



TSTB <src>,<dst> 
t/sf AND src 

src general addressing modes (G): 

register (Rn, ^ n ^ 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 I 1 1 
110 10 


1 

G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description The bitwise logical-AND of the dst and src operands is formed, but the re- 
sult is not loaded in any register. This allows for nondestructive compares. 
The dst and src operands are assumed to be unsigned integers. 

Cycles 1 



Status Bits 


N 


MSB of the output. 




Z 


1 if a zero output is generated, otherwise 




V 







c 


Unaffected. 




UF 







LV 


Unaffected. 




LUF 


Unaffected. 


l\/lode Bit 


OVM 


Operation not affected by OVM. 




Example TSTB *-AR4 ( 1 ) , R5 

Before Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1895 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1 895 

LUF LV UF N Z V C = 1 



11-178 



Test Bit Fields, 3-Operand 



TSTB3 



Syntax TSTB3 <src2>,<src1> 

Operation srcl AND src2 

Operands srcl three-operand addressing modes (T): 

register (Rnl, £ n1 ^ 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, ^ n1 ^ 27) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 ^ 27) 
10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

Encoding 



31 


24 23 




1615 




87 







1 


1 1 1 1 1 
1111 


1 

T 


1 1 1 1 



1 1 1 1 1 1 1 
srcl 


1 1 1 I 1 1 1 
src2 



Description 



Cycles 
Status Bits 



The bitwise logical-AND between the srd and src2 operands is formed, 
but is not loaded into any register. This allows for nondestructive compares. 
The srcl and src2 operands are assumed to be unsigned integers. Although 
this instruction has only two operands, it is designated as a three operand 
instruction since operands are specified in the three operand format. 



1 

N 

Z 
V 

c 

UF 
LV 



MSB of the output. 

1 if a zero output is generated, otherwise. 



Unaffected. 



Unaffected. 



IVIode Bit 



LUF Unaffected. 

OVM Operation not affected by OVM. 
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TSTB3 



Test Bit Fields, 3-Operand 



Example TSTB3 *AR5 — (IRO) , *+ARO ( 1) 

Before Instruction: 

AR5 = 809885h 

IRO = BOh 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1895 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR5 = 809805h 

IRO = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1895 

LUF LV UF N Z V C = 1 

Example TSTB3 R4,*AR6 — (IRO) 

Before Instruction: 

R4 = 0FBC4h 

AR6 = 8099F8h 

IRO = 8h 

Data at 8099F8h = 1568h. 

LUFLVUFNZVC = 0000000 

After Instruction: 

R4 = 0FBC4h 

AR6 = 8099F0h 

IRO = 8h 

Data at 8099 F8h = 1568h 

LUFLVUFNZVC = 0000000 
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Bit\A^ise Exciusive-OR 



XOR 



Syntax 

Operation 

Operands 



XOR <src>,<dst> 

dst XOR src -* dst 

src general addressing modes (G): 
register (Rn, ^ n £ 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, ^ n ^ 27) 



Encoding 



31 


24 23 






1615 


87 





1 1 



1 1 1 1 1 
110 10 1 


1 

G 


1 1 1 1 
dst 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
src 



Description 

Cycles 
Status Bits 



/VIode Bit 
Example 



The bitwise exclusive-OR of Xhe src and dst operands is loaded into the 
dst register. The dst and src operands are assumed to be unsigned integers. 

1 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 

C Unaffected. 

UF 

LV Unaffected. 

LUF Unaffected. 

OVM Operation not affected by OVM. 

XOR R1,R2 

Before Instruction: 

R1 = 0FFA32h 
R2 - 0FF5C1h 
LUFLVUFNZVC = 0000000 

After Instruction: 

R1 = 0FF3A2h 
R2 = 0OOFF3h 
LUFLVUFNZVC = 0000000 
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XOR3 



Bitwise Exclusive-OR, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 
31 



X0R3 <src2> ,<src1 > ,<dst> 

srcl XOR src2 -* dst 

srcl three-operand addressing modes (T): 
register (Rnl, ^ n1 ^ 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rnl, ^ n1 :S 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

src2 three-operand addressing modes (T): 
register (Rn2, ^ n2 ^ 27) 

1 register (Rn2, ^ n2 rs 27) 

1 Indirect (disp = 0, 1 , IRO, IR1 ) 
11 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, ^ n ^ 27) 



24 23 

T 1 1 1 1 1 1 1 

10 10 



1615 



87 



-i — r 
dst 



T — r— 
src2 



srcl 



Description 



Cycles 
Status Bits 



The bitwise exclusive-OR between the srcl and src2 operands is loaded 
into the dst register. The src7, src2, and dst operands are assumed to be 
unsigned integers. 



1 

N 

Z 
V 

c 

UF 
LV 



MSB of the output. 

1 if a zero output is generated, otherwise. 



Unaffected. 



Unaffected. 



Mode Bit 



LUF Unaffected. 

OVM Operation not affected by OVM. 
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Bit\A/ise Exclusive-OR, 3-Operand 



X0R3 



Example 



Example 



X0R3 *AR3++ ( IRO ) , R7 , R4 
Before Instruction: 

AR3 = 809800h 

IRO = lOh 

R7 = OFFFFh 

R4 = Oh 

Data at 809800h = 5AC3h 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR3 = 80980Fh 

IRO = lOh 

R7 = OFFFFh 

R4 = 0A53Ch 

Data at 809800h = 5AC3h 

LUF LVUFNZVC = 0000000 

X0R3 R5,*-AR1(1) ,R1 
Before Instruction: 

R5 = 0FFA32h 

AR1 = 809826h 

R1 = Oh 

Data at 809825h = 0FF5C1h 

LUFLVUFNZVC = 0000000 



After Instruction: 

R5 = 0FFA32h 

AR1 = 809826h 

R1 = 000F33h 

Data at 809825h = 0FF5C1h 

LUFLVUFNZVC = 0000000 
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XOR3IISTI 



Parallel X0R3 and STI 



Syntax 



Operation 



Operands 



Encoding 
31 



X0R3 <src2><src1>,<dst1> 
STI <src3> ,<dst2> 



srcl XOR src2 
src3 -* dst2 



dsti 



srcl register (Rnl, ^ n1 ^7) 

src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dsti register (Rn2, ^ n2 < 7) 

src3 register (Rn3, £ n3 < 7) 

dst2 indirectO(disp = 0, 1, IRO, 1R1) 



1 1 1 1— T 1 

1110 111 



24 23 

— ; — r 
dsti 



1615 



87 



—\ — r- 
src3 



T 1 1 

src2 



srcl 



dst2 



Description 



Cycles 
Status Bits 



l\/lode Bit 



A bitwise exclusive-XOR and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a 
register and the operation being performed in parallel (X0R3) writes to the 
same register, then STI accepts as input the contents of the register before 
it is modified by the X0R3. 

If src2 and dst2 point to the same location, src2 is read before the write to 
dst2. 

1 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 

C Unaffected. 

UF 

LV Unaffected. 

LUF Unaffected. 



OVM Operation not affected by OVM. 
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Parallel X0R3 and STI X0R3||STI 



Example X0R3 *AR1++,R3,R3 

I I STI R6,*-AR2(IR0) 



Before Instruction: 

AR1 = 80987Eh 

R3 = 85h 

R6 = ODCh = 220 

AR2 = 8098B4h 

IRO = 8h 

Data at 80987Eh = 85h 

Data at 8098ACh = Oh 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 80987Fh 

R3 = Oh 

R6 = ODCh = 220 

AR2 = 8098B4h 

IRO = 8h 

Data at 80987Eh = 85h 

Data at 8098ACh = ODCh = 220 

LUF LVUFNZVC = 0000000 
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Assembly Language Instructions - Individual Instructions 
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Introduction 



■k"m&il 



■m 



■jKtf^fi^tif^ff^ 



Pinout and Signal Descriptions 



??i'3f^^iS 
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CPU Registers, JVlemory, and Cache 



Data Formats and Floating-Point Operation 
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^ii 
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EKternal Bus Operation 'l^^mB^ 
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Pipeline Operation 






Assembly Language Instructions 



Software Applications 



Hardware Applications 



Appendices 




Section 12 

Software Applicatioins 



'■■"""■'■'■■«''""''"' ■■' "■""■" 



The TMS320C30 is a very powerful digital signal processor with architecture 
and instruction set designed to make easy system solutions to DSP problems. 
There are instructions specifically designed for efficient implementations of 
DSP algorithms, but also there are general-purpose instructions that make the 
device suitable for more general tasks, like any microprocessor. The floating 
point and integer arithmetic supported by the device permits the designer to 
concentrate on the algorithm with minimal concerns about scaling, dynamic 
range, and overflows. 

The purpose of this section is to explain how to use the instruction set, the 
architecture, and the interface of the TMS320C30 processor. This is done by 
presenting coding examples for very frequently used applications, and by dis- 
cussing more involved examples and applications. In all cases, besides ex- 
plaining the principles involved in the application, the corresponding 
assembly-language code is given for instructional purposes and for immediate 
use. Whenever the detailed explanation of the underlying theory is too ex- 
tensive to be included in this manual, appropriate references are given for 
further information. 

Major topics discussed in this section are listed below. 

O Processor Initialization (Section 12.1 on page 12-3) 

O Program Control (Section 12.2 on page 12-7) 

- Subroutine calls 

- Software stack 

- Interrupt handling 

- Delayed branches 

- Repeat modes 
Computed GOTO's 

O Logical and Arithmetic Operations (Section 12.3 on page 12-20) 

- Bit manipulation 

- Block moves 

- Bit-reversed addressing 

- Division 

- Square root 

- Extended -precision arithmetic 

- IEEE < = = > C30 floating point conversions 

O Application-oriented Operations (Section 12.4 on page 12-45) 

- Companding (A-law, M-law) 

FIR / IIR filters (fixed and adaptive) 

- Matrix math 
FFT 
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Software Applications 



- Lattice filters 

Programming Tips (Section 12.5 on page 12-87) 

- C-callabie Routines 

- Code Optimization Check-list 
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Software Applications - Processor Initialization 



12.1 Processor Initialization 



Prior to the execution of a digital signal processing algorithm, it is necessary 
to initialize the processor. Generally, initialization takes place any time the 
processor is reset. 



When reset is activated by applying a low level to the RESET input for several 
cycles, the TI\/IS320C30 terminates execution and puts the reset vector (i.e., 
the contents of memory location 0) in the program counter. The reset vector 
normally contains the address of the system initialization routine. The hard- 
ware reset also initializes various registers and status bits. 

After reset, the processor should be initialized to meet the requirements of the 
system. Instructions should be executed that set up operational modes, me- 
mory pointers, interrupts, and the remaining functions needed to meet system 
requirements. 

To configure the processor at reset, the following internal functions should 
be initialized: 

© Memory-mapped registers 
O Interrupt structure 

Example 12-1 shows coding for initializing the TMS320C30 to the following 
machine state, in addition to the initialization performed during the hardware 
reset (for conditions after hardware reset, see section 13): 

O All interrupts are enabled. 

O The overflow mode is disabled. 

O The data memory page pointer is set to zero. 

O The internal memory is filled with zeros. 

Note that all constants larger than 16 bits should be placed in memory and 
accessed through direct or indirect addressing. 
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Example 12-1. TI\/IS320C30 Processor Initialization 



TITL 'PROCESSOR INITIALIZATION EXAMPLE' 

•global RESET, INIT, BEGIN 

.global INT0,INT1,INT2,INT3 

.global ISR0,ISR1,ISR2,ISR3 

.global DINT, DMA 

•global TINT0,TINT1,XINT0,RINT0,XINT1,RINT1 

. global TIMED , TIMEl , XMTO , RCVO , XMTl , RCVl 

. global TRAPO , TRAPl , TRAP2 , TRPO , TRP 1 , TRP2 

PROCESSOR INITIALIZATION FOR THE TMS320C30. 

RESET AND INTERRUPT VECTOR SPECIFICATION. THIS 
ARRANGEMENT ASSUMES THAT DURING LINKING, THE FOLLOWING 
TEXT SEGMENT WILL BE PLACED TO START AT MEMORY 
LOCATION 0. 



RESET 
* 

INTO 
INTl 
INT2 
INT3 



sect 


"init 


word 


INIT 


word 


ISRO 


word 


ISRl 


word 


ISR2 


word 


ISR3 



Named section 

RS- loads address INIT to PC 

INTO- loads address INTO to PC 
INTl- loads address INTl to PC 
INT2- loads address INT2 to PC 
INT3- loads address INT3 to PC 



XINTO 
RINTO 
XINTO 
RINTO 
TINTO 
TINTl 
DINT 

TRAPO 
TRAPl 
TRAP 2 



.word 

.word 

.word 

.word 

.word 

.word 

.word 

. space 

.word 

.word 

.word 

. space 



XMTO 

RCVO 

XMTl 

RCVl 

TIMEO 

TIMEl 

DMA 

20 

TRPO 

TRPl 

TRP 

29 



Serial port transmit processing 
Serial port receive processing 
Serial port 1 transmit processing 
Serial port 1 receive processing 
Timer interrupt processing 
Timer 1 interrupt processing 
DMA interrupt 
Reserved space 

Trap vector processing begins 
Trap 1 vector processing begins 
Trap 2 vector processing begins 
Leave space for the other 29 traps 




* IN THIS 

* IN THE 

MASK 

BLKO 

BLKO 

STCK 

CTRL 

DMACTL 

TIMOCTL 

TIMICTL 

SERGLOBO 

SERPRTXO 

SERPRTRO 

SERTIMO 

SERGLOBl 

SERPRTXl 

SERPRTRl 

SERTIMl 

PARINT 

lOINT 



SECTION, CONSTANTS THAT CANNOT BE REPRESENTED 
SHORT FORMAT ARE INITIAlIZED^ 
.data 



.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 



OFFFFFFFFH 

0809800H 

0809C00H 

0809F00H 

0808000H 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 

OOOOOOOH 



Initialization 
Initialization 
Init of serial 



Beginning address of RAM block 
Beginning address of RAM block 1 
Beginning of stack 

Pointer for peripheral-bus memory map 
Initialization for DMA control (0) 

of timer control (32) 
of timer 1 control (48) 
glbl control (64) 
xmt port control (66) 
rev port control (67) 

timer control (58) 

1 glbl control (80) 
1 xmt port control (82) 
1 rev port control (83) 
1 timer control (84) 

Init parallel interface control (95) 
Init I/O interface control (100) 



Init 
Init 
Init 
Init 
Init 
Init 
Init 



of 
of 
of 
of 
of 
of 
of 



serial 
serial 
serial 
serial 
serial 
serial 
serial 
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.text 

THE ADDRESS AT MEMORY LOCATION DIRECTS EXECUTION TO BEGIN HERE 
FOR RESET PROCESSING THAT INITIALIZES THE PROCESSOR. WHEN RESET 
IS APPLIED, THE FOLLOWING REGISTERS ARE INITIALIZED TO ZERO: 

ST CPU STATUS REGISTER 

IE CPU/DMA INTERRUPT ENABLE FLAGS 

IF CPU INTERRUPT FLAGS 

I OF I/O FLAGS 



THE STATUS REGISTER HAS THE FOLLOWING ARRANGEMENT; 



BITS: 
FUNCTION : 



31-14 
RESRV 



13 
GIE 



12 
CC 



11 10 9 87 6 543 
CE CF RES RM OVM LUF LV UF N 



INIT 



LDP 
LDI 
LDI 



0,DP 

1800H,ST 

@MASK,IE 



Point the DP register to page 

Clear and enable cache, and disable OVM 

Unmask all interrupts 



LDI 


@BLKO,ARO 


LDI 


@BLK1,AR1 


LDF 


0.0, RO 


RPTS 


1023 


STF 


R0,*AR0++(1) 


STF 


R0,*AR1++(1) 



INTERNAL DATA MEMORY INITIALIZATION TO FLOATING POINT ZERO 

ARO points to block 

ARl points to block 1 

Zero register RO 

Repeat 1024 times , . . 

Zero out location in RAM block and 

II STF R0,*AR1++(1) ; zero out location in RAM block 

* 

* THE PROCESSOR IS INITIALIZED. THE REMAINING APPLICATION- 

* DEPENDENT PART OF THE SYSTEM (BOTH ON- AND OFF-CHIP SHOULD 

* NOW BE INITIALIZED. 
* 

* FIRST, INITIALIZE THE CONTROL REGISTERS. IN THIS EXAMPLE, 

* EVERYTHING IS INITIALIZED TO ZERO SINCE THE ACTUAL INITIAL- 

* IZATION IS APPLICATION DEPENDENT. 



LDI 


(aCTRL,ARO 


LDI 


@DMACTL,RO 


STI 


RO,*+ARO(0) 


LDI 


@TIMOCTL,RO 


STI 


R0,*+AR0(32) 


LDI 


@TIM1CTL,R0 


STI 


R0,*+AR0(48) 


LDI 


@SERGLOB0,R0 


STI 


R0,*+AR0(64) 


LDI 


§SERPRTX0,R0 


STI 


R0,*+AR0(66) 


LDI 


@SERPRTRO,RO 


STI 


R0,*+AR0(67) 


LDI 


@SERTIM0,R0 


STI 


R0,*+AR0(68) 


LDI 


@SERGLOBL,R0 


STI 


R0,*+AR0(80) 


LDI 


§SERPRTX1,R0 


STI 


R0,*+AR0(82) 


LDI 


(aSERPRTRl,R0 


STI 


R0,*+AR0(83) 



LOAD in ARO the pointer to control 
registers 

Init DMA control 

Init timer control 

Init timer 1 control 

Init serial global control 

Init serial xmt control 

Init serial rev control 

Init serial timer control 



Init serial 1 



Init serial 1 



global control 
xmt control 




Init serial 1 rev control 
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LDI 
STI 
LDI 
STI 
LDI 
STI 

LDI 
OR 

BR 

. end 



@SERTIM1,R0 

R0,*+AR0(84) 

@PARINT,RO 

R0,*+AR0(96) 

@IOINT,R0 

R0,*+AR0(100) 

§STCK,SP 
2000H,ST 

BEGIN 



Init serial timer control 

Init parallel interface control 

Init I/O interface control 

Initialize the stack pointer 
Global interrupt enable 

Branch to the beginning of application. 
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12.2 Program Control 



To facilitate the TMS320C30's use in general-purpose, high-speed process- 
ing, a variety of instructions are provided to handle the: 

O subroutine calls 

9 software stack 

• interrupts 

© zero-overhead branches 

© single- and multiple-instruction loops without any overhead. 

This section describes how to use these features of the TMS320C30. 



12.2.1 Subroutines 



The TMS320C30 has a 24-bit program counter (PC) and a practically unlim- 
ited software stack. The CALL and CALLcond subroutine calls cause the stack 
pointer to increment, and store the contents of the next value of the PC 
counter on the stack. At the end of the subroutine, RETScond performs a 
conditional return. 

Example 12-2 illustrates the use of a subroutine to determine the dot product 
between two vectors. Given two vectors of length N, represented by the ar- 
rays a[0], a[1],..., a[N-1] and b[0], b[1] b[N-1], the dot product is com- 
puted from the expression 

d = a[0] b[0] + a[1] b[1] + ... + a[N-1] b[N-1] 

Processing proceeds in the main routine to the point where the dot product 
is to be computed. It is assumed that the arguments of the subroutine have 
been appropriately initialized. At this point, a CALL is made to the subroutine, 
transferring control to that section of the program memory for execution, then 
returning to the calling routine via the RETS instruction when execution has 
completed. Note for this particular example, it would suffice to save the reg- 
ister R2. However, a larger number of registers are saved for demonstration 
purposes. The saved registers are stored on the system stack. This stack 
should be large enough to accommodate the maximum anticipated storage 
requirements. Besides this way of saving registers, any other method could 
be used equally well. 




12-7 



Software Applications - Program Control 



Example 12-2. Subroutine Call (Dot Product) 

* 

* TITL SUBROUTINE CALL (DOT PRODUCT) 



MAIN ROUTINE THAT CALLS THE SUBROUTINE 'DOT' TO COMPUTE THE 
DOT PRODUCT OF TWO VECTORS. 



LDI 
LDI 
LDI 

CALL 



@blkO,ARO 
@blkl,ARl 
N,RC 

DOT 



ARO points to vector a 
ARl points to vector b 
RC contains the number of elements 



SUBROUTINE DOT 



EQUATION: d = a(0) * b(0) + a{l) * b(l) + 



+ a(N-l) * b(N-l) 



THE DOT PRODUCT OF a AND b IS PLACED IN REGISTER RO , 
BE GREATER THAN OR EQUAL TO 2, 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

ARO I ADDRESS OF a(0) 
ARl I ADDRESS OF b(0) 
RC I LENGTH OF VECTORS (N) 



N MUST 



REGISTERS USED AS INPUT 
REGISTER MODIFIED: RO 
REGISTER CONTAINING RESULT 



ARO, ARl, RC 
RO 



DOT 




.global 


DOT 


PUSHF 


R2 


PUSH 


R2 


PUSH 


ST 


PUSH 


ARO 


PUSH 


ARl 


PUSH 


RC 


MPYF3 


*AR0,*AR1,R0 


LDF 


0.0, R2 


SUBI 


2,RC 



Use the stack to save R2'S 

bottom 32 and top 32 bits 
Save status register 
Save ARO 
Save ARl 
Save RC 
Initialize RO: 
a(0) * b(0) -> RO 
Initialize R2 . 
Set RC = N-2 
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* DOT PRODUCT (1 <= i < N) 



I I 



RPTS RC ; Setup the repeat single. 

MPYF3 *++ARO(l) ,*++ARl(l) ,R0 ; a( i) * b ( i) -> RO 
ADDF3 R0,R2,R2 ; a( i-1) *b ( i-1) + R2 -> R2 

a(N-l)*b(N-l) + R2 -> RO 



Restore RC 

Restore ARl 

Restore ARO 

Restore ST 

Restore top 32 bits of R2 

Restore bottom 32 bits of R2 

Return 



ADDF3 


RO , R2 , RO 


RETURN SEQUENCE 


POP 


RC ; 


POP 


ARl ; 


POP 


ARO ; 


POP 


ST ; 


POPF 


R2 ; 


POP 


R2 


RETS 




end 




end 





12.2.2 Software Stack 



The TMS320C30 has a software stack whose location is determined by the 
contents of the stack pointer register SP. The stack pointer increments from 
low to high values, and provisions should be made to accommodate the an- 
ticipated storage requirements. The stack can be used not only during the 
subroutine CALL and RETS, but also inside the subroutine as a place of tem- 
porary storage of the registers as shown in Example 12-2. SP always points 
to the last value pushed on the stack. 

The CALL and CALLcond instructions push the value of the program counter 
on the stack, as do the interrupt routines. Then, RETScond and RETIcond pop 
the stack and place the value in the program counter. The integer value of any 
register can be pushed on and popped off the stack using the PUSH and POP 
instructions. There are two additional instructions, PUSHF and POPF, for 
floating point numbers. These instructions can be used to pop and push 
floating point numbers to registers R0-R7. This feature is very useful if it is 
desired to save the extended precision registers (see Example 12-2). By using 
PUSH and PUSHF on the same register, the lower 32 and the upper 32 bits 
are saved. PUSH saves the lower 32; PUSHF, the upper 32. To recover this 
extended precision number, a POPF can be done followed by POP. It is im- 
portant to do the integer and floating-point PUSH and POP in the above or- 
der. POPF forces the last eight bits of the extended-precision registers to zero. 

The stack pointer (SP) can be both read from, and written to. Multiple stacks 
for different program segments may be easily created. SP is not initialized by 
the hardware during reset. It is therefore important to remember to initialize 
its value so that SP points to a predetermined memory location. This avoids 
the problem of SP attempting to write into ROM or over other useful data. 
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12.2.3 Interrupt Service Routines 

Interrupts on the TMS320C30 are prioritized and vectored. When an interrupt 
occurs, the corresponding flag is set in the Interrupt Flag Register IF. If the 
corresponding bit in the Interrupt Enable Register IE is set, and interrupts are 
enabled by having the GIE bit in the status register set to 1, interrupt proc- 
essing begins. The interrupt flag register can also be written to. This enables 
the user to force an interrupt by software, or to clear interrupts without proc- 
essing them. 

The Interrupt Flag Register IF can be read, and action taken based on whether 
the interrupt has occurred. This is true even when the interrupt is disabled. 
This can be useful when an interrupt-driven interface is not implemented. Ex- 
ample 12-3 shows the case where a subroutine is called when interrupt 1 has 
not occurred. 

Example 12-3. Use of Interrupts for Software Polling 

* TITL INTERRUPT POLLING 



TSTB 2, IF ; Test if interrupt 1 has occurred 

CALLZ SUBROUTINE ; If not, call subroutine 



When interrupt processing begins, the program counter is pushed on the 
stack, and the interrupt vector is loaded in the program counter. Interrupts are 
then disabled by setting the GIE=0, and the program continues from the ad- 
dress loaded in the program counter. Since all interrupts are disabled, inter- 
rupt processing may proceed without further interruption, unless the interrupt 
service routine re-enables interrupts. 

Except for very simple interrupt service routines, it is important to assure that 
the processor context is saved during execution of this routine. The context 
must be saved before executing the routine itself, and restored after the routine 
is finished. The procedure is called context switching. Context switching is 
also useful for subroutine calls, especially when extensive use is made of the 
auxiliary and the extended precision registers. Code examples of context 
switching and an interrupt service routine are provided in this section. 
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12.2.3. 1 Context Switching 



Context switching is commonly required when processing a subroutine call 
or interrupt. It may be quite extensive or simple, depending on system re- 
quirements. On the TMS320C30, the program counter is automatically 
pushed on the stack. If there is any important information in the other 
TMS320C30 registers, such as the status, auxiliary or extended-precision re- 
gisters, these must be saved by special commands. 

Examples 12-4 and 12-5 show saving and restoring of theTMS320C30 state. 
In both examples, the stack is used for saving the registers, and it expands 
towards higher addresses. If it is not desirable to use the stack pointed at by 
SP, a separate stack can be created using an auxiliary register as the stack 
pointer. The registers saved are: 

O Extended -precision registers RO through R7 

O Auxiliary registers ARO through AR7 

O Data page pointer DP 

O Index registers IRO and IR1 

O Block size register BK 

• Status register ST 

O Interrupt-related registers IE and IF 

• I/O flag lOF 

© Repeat- related registers RS, RE, and RC 
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Example 12-4. Context Save For The TMS320C30 

* TITL CONTEXT-SAVE FOR THE TMS320C30 
* 

* 

.global SAVE 
* 

* CONTEXT SAVE ON SUBROUTINE CALL OR INTERRUPT. 



SAVE: 



SAVE THE EXTENDED PRECISION REGISTERS 



PUSH 


RO 


PUSHF 


RO 


PUSH 


Rl 


PUSHF 


Rl 


PUSH 


R2 


PUSHF 


R2 


PUSH 


R3 


PUSHF 


R3 


PUSH 


R4 


PUSHF 


R4 


PUSH 


R5 


PUSHF 


R5 


PUSH 


R6 


PUSHF 


R6 


PUSH 


R7 


PUSHF 


R7 



Save the lower 32 

and the upper 
Save the lower 32 

and the upper 
Save the lower 32 

and the upper 
Save the lower 32 

and the upper 
Save the lower 32 

and the upper 
Save the lower 32 

and the upper 
Save the lower 32 

and the upper 
Save the lower 32 

and the upper 



bits of RO 
32 bits 
bits of Rl 
32 bits 
bits of R2 
32 bits 
bits of R3 
32 bits 
bits of R4 
32 bits 
bits of R5 
32 bits 
bits of R6 
32 bits 
bits of R7 
32 bits 



SAVE THE AUXILIARY REGISTERS 



PUSH 


ARC 


PUSH 


ARl 


PUSH 


AR2 


PUSH 


AR3 


PUSH 


AR4 


PUSH 


AR5 


PUSH 


AR6 


PUSH 


AR7 



Save 


ARO 


Save 


ARl 


Save 


AR2 


Save 


AR3 


Save 


AR4 


Save 


AR5 


Save 


AR6 


Save 


AR7 



SAVE THE REST REGISTERS FROM THE REGISTER FILE 




PUSH 


DP 


PUSH 


IRQ 


PUSH 


IRl 


PUSH 


BK 


PUSH 


ST 


PUSH 


IE 


PUSH 


IF 


PUSH 


lOF 


PUSH 


RS 


PUSH 


RE 


PUSH 


RC 



Save data page pointer 

Save index register IRO 

Save index register IRl 

Save block-size register 

Save status register 

Save interrupt enable register 

Save interrupt flag register 

Save I/O flag register 

Save repeat start address 

Save repeat end address 

Save repeat counter 



SAVE IS COMPLETE 
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Example 12-5. Context- Restore For The TMS320C30 

* 

* TITL CONTEXT-RESTORE FOR THE TMS320C30 
* 

* 

.GLOBAL RESTR 

* 

* CONTEXT RESTORE AT THE END OF A SUBROUTINE CALL OR INTERRUPT. 



RESTR: 



RESTORE THE REST REGISTERS FROM THE REGISTER FILE 



POP 


RC 


POP 


RE 


POP 


RS 


POP 


lOF 


POP 


IF 


POP 


IE 


POP 


ST 


POP 


BK 


POP 


IRl 


POP 


IRQ 


POP 


DP 



Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 



* RESTORE THE AUXILIARY REGISTERS 



POP 
POP 
POP 
POP 
POP 
POP 
POP 
POP 



AR7 
AR6 
AR5 
AR4 
AR3 
AR2 
ARl 
ARO 



Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 
Restore 



repeat counter 
repeat end address 
repeat start address 
I/O flag register 
interrupt flag register 
interrupt enable register 
status register 
block-size register 
index register IRl 
index register IRQ 
data page pointer 



AR7 
AR6 
AR5 
AR4 
AR3 
AR2 
ARl 
ARO 



RESTORE THE EXTENDED PRECISION REGISTERS 



POPF 


R7 


POP 


R7 


POPF 


R6 


POP 


R6 


POPF 


R5 


POP 


R5 


POPF 


R4 


POP 


R4 


POPF 


R3 


POP 


R3 


POPF 


R2 


POP 


R2 


POPF 


Rl 


POP 


Rl 


POPF 


RO 


POP 


RO 



Restore the upper 32 bits and 

the lower 32 bits of R7 
Restore the upper 32 bits and 

the lower 32 bits of R6 
Restore the upper 32 bits and 

the lower 32 bits of R5 
Restore the upper 32 bits and 

the lower 32 bits of R4 
Restore the upper 32 bits and 

the lower 32 bits of R3 
Restore the upper 32 bits and 

the lower 32 bits of R2 
Restore the upper 32 bits and 

the lower 32 bits of Rl 
Restore the upper 32 bits and 

The lower 32 bits of RO 




RESTORE IS COMPLETE 
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12.2.3.2 Interrupt Priority 

Interrupts on the TMS320C30 are automatically prioritized. This allows in- 
terrupts that occur simultaneously to be serviced in a predefined order. Infre- 
quent, but lengthy, interrupt service routines may need to be interrupted by 
more frequently occurring interrupts. In Example 12-6, the interrupt service 
routine for INT2 temporarily modifies the interrupt enable register IE, to permit 
interrupt processing when an interrupt to INTO (but no other interrupt) occurs. 
When the routine has finished processing, the register IE is restored to its ori- 
ginal state. Notice that the RETI instruction not only pops the next program 
counter address from the stack, but also sets the GIE bit of the status register. 
This enables all interrupts which have their interrupt-enable bit set. 

Example 12-6. Interrupt Service Routine 

* TITL INTERRUPT SERVICE ROUTINE 

* .global ISR2 

ENABLE .set 2000h 

MASK .set 1 
* 

* INTERRUPT PROCESSING FOR EXTERNAL INTERRUPT INT2- 



ISR2: 



PUSH 


ST 


PUSH 


DP 


PUSH 


IE 


PUSH 


RO 


PUSHF 


RO 


PUSH 


Rl 


PUSHF 


Rl 


LDI 


MASK, IE 


OR 


ENABLE, ST 



Save status register 
Save data page pointer 
Save interrupt enable register 
Save lower 32 bits and 
upper 32 bits of RO 
Save lower 32 bits and 
upper 32 bits of Rl 
Unmask only INTO 
Enable all interrupts 



MAIN PROCESSING SECTION FOR ISR2 




XOR 


ENABLE , ST 


POPF 


Rl 


POP 


Rl 


POPF 


RO 


POP 


RO 


POP 


IE 


POP 


DP 


POP 


ST 



RETI 



Disable all interrupts 
Restore upper 32 bits and 

lower 32 bits of Rl 
Restore upper 32 bits and 

lower 32 bits of RO 
Restore interrupt enable register 
Restore data page register 
Restore status register 

Return and enable interrupts 
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12.2.4 Delayed branches 

The TMS320C30 offers the capability of single-cycle branching through the 
use of the delayed branches. The delay branches operate like regular branches 
but do not flush the pipeline. Instead, the three instructions following a de- 
layed branch are also executed. As discussed in the section on program-flow 
control, the only limitation is that the three instructions following a delayed 
branch cannot be a: 

O Branch (standard or delayed) 

O Call to a subroutine 

O Return from a subroutine 

O Return from an interrupt 

o Repeat instructions 

O A TRAP instruction 

O An IDLE instruction 

Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. Sometimes, a branch 
is necessary in the flow of a program, but less than three instructions can be 
placed after a delayed branch. For faster execution, it is still advantageous to 
use a delayed branch. This is shown in Example 12-7, with NOP's taking the 
place of the unused instructions. The trade-off is more instruction words for 
less execution time. 

Example 12-7. Delayed Branch Execution 

* TITL DELAYED BRANCH EXECUTION 



LDF 


*+ARl(5),R2 ; 


BGED 


SKIP ; 


LDFN 


R2,R1 ; 


SUBF 


3.0,R1 ; 


NOP 




MPYF 


1.5,R1 ; 



Load contents of memory to R2 

If loaded number >=0, branch (delayed) 

If loaded number <0, load it to Rl 

Subtract 3 from Rl 

Dummy operation to complete delayed 

branch 

Continue here if loaded number <0 




SKIP 



LDF 



R1,R3 



Continue here if loaded number >=0 
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12.2.5 Repeat Modes 




The TMS320C30 supports looping without any overhead. For that purpose, 
there are two instructions: RPTB repeats a block of code, and RPTS repeats a 
single instruction. There are three control registers RS (repeat start address), 
RE (repeat end address), and RC (repeat counter). These contain the param- 
eters that specify loop execution (refer to Section 7.1 for a complete de- 
scription of RPTB and RPTS). RS and RE are automatically set from the code, 
while RC has to be set by the user, as shown in the examples below. 

72.2.5. 1 Block Repeat 

Example 1 2-8 shows an application of the block repeat construct. In this ex- 
ample, an array of 64 elements is "flipped over" by exchanging the elements 
that are equidistant from the end of the array. In other words, if the original 
array is: 

a(1), a(2) a(31), a(32) a(64); 

the final array after the rearrangement will be: 

a(64), a(63),..., a(32), a(31 ),..., a(1). 

Note that since the exchange operation is done on two elements at the same 
time, there is a need of 32 operations. The repeat counter RC is initialized to 
31. In general, if RC contains the number N, the loop will be executed N+1 
times. The loop is defined by the RPTB instruction and the EXCH label. 

Example 12-8. Loop Using Block Repeat 

* TITL LOOP USING BLOCK REPEAT 
* 

* THIS CODE SEGMENT EXCHANGES THE VALUES OF ARRAY ELEMENTS THAT ARE 

* SYMMETRIC AROUND THE MIDDLE OF THE ARRAY. 



ARO points to the beginning of the array 

ARl points to the end of the 

64-element array 
Initialize repeat counter 

Repeat RC+1 times between here and 

EXCH 
Load one memory element in RO, 

and the other in Rl 
Then, exchange their locations 



Section 7.1.2 specifies restrictions in the block-repeat construct. The basic 
rule is, since the program counter is modified at the end of the loop according 
to the contents of the registers RS, RE, and RC, no operation should attempt 



ie 


LDI 
LDI 
ADD I 


@ADDR,ARO 
ARO, ARl 
63, ARl 




LDI 


31,RC 




RPTB 


EXCH 


II 

EXCH 
M 


LDI 
LDI 
STI 
STI 


*ARO,RO 
*AR1,R1 
R1,*AR0++(1) 
R0,*AR1~(1) 
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to modify the repeat counter or the program counter at the end of the loop in 
a different way. 

In principle, it is possible to nest repeat blocks. However, there is only one 
set of control registers RS, RE, and RC. It is therefore necessary to save these 
registers before entering an inside loop. It may be more economical to im- 
plement a nested loop by the more traditional method of a register serving as 
a counter and then using a delayed branch rather than applying the above 
approach. 

Example 12-9 shows another example of using the block repeat to find a 
maximum of 147 numbers. 

Example 12-9. Use of Block Repeat to Find a Maximum 



TITL USE OF BLOCK REPEAT TO FIND A MAXIMUM 
THIS ROUTINE FINDS THE MAXIMUM OF N=147 NUMBERS 



LOOP 



LDI 
LDI 
LDF 

RPTB 
CMPF 
LDFLT 



146, RC 
@ADDR,ARO 
*AR0++(1) ,R0 

LOOP 

*AR0++(1) ,R0 
*-ARO(l) ,R0 



Initialize repeat counter to 147-1 

ARO points to the beginning of the array 

Initialize MAX to the first value 



Compare number to the maximum 

If greater, this is a new maximum 



12.2.5.2 Single-Instruction Repeat 



The single instruction repeat operates using the control registers RS, RE, and 
RC, in the same way as the block repeat. The advantage over the block repeat 
is that the instruction is fetched only once, and then the buses are available 
for moving operands. One difference to note is that the single-instruction re- 
peat construct is not interruptible, while block repeat is interruptible. 

Example 12-10 shows an application of the repeat-single construct. In this 
example, the sum of the products of two arrays is computed. The arrays are 
not necessarily different. If the arrays are a(i) and b(i), each of length N=51 2, 
register RO will contain, after computation, the quantity: 

a(1)b(1)+a(2)b(2) + ...+a(N)b(N). 

The value of the repeat counter (RC) is specified to be 51 1 in the instruction. 
If RC contains the number N, the loop will be executed N+1 times. 
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Example 12-10. 

* TITL LOOP USING SINGLE REPEAT 



* THIS CODE SEGMENT COMPUTES X a(i)b(i) 

* i=l 



LDI 
LDI 

LDF 

MPYF3 

RPTS 

MPYF3 
ADDF3 

ADDF 



@ADDR1,AR0 ; ARC points to array a(i) 
@ADDR2,AR1 ; ARl points to array b(i) 

0.0, RO ; Initialize RO 

* ARO++ (1 ) , * AR1++ ( 1 ) , Rl 

; Compute first product 
511 ; Repeat 512 times 

*AR0++(1) ,*AR1++(1) ,R1,R0 ; Compute next product 
R1,R0,R0 ; and accumulate the previous one 



R1,R0 



One final addition 



12.2.6 Computed GOTO's 



Occasionally, it is convenient to select during runtime, and not during assem- 
bly, what subroutine needs to be executed. The TMS320C30 offers the ca- 
pability of a computed GOTO that can satisfy such a need. The computed 
GOTO is implemented using the CALLcond instruction in the register ad- 
dressing mode. This instruction uses the contents of the register as the ad- 
dress of the call. Example 1 2-1 1 shows the case of a task controller. 
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Example 12-11. Computed GOTO 

* TITL COMPUTED GOTO 

* 

* TASK CONTROLLER 

* 

* THIS MAIN ROUTINE CONTROLS THE ORDER OF TASK EXECUTION (6 TASKS 

* IN THE PRESENT EXAMPLE). TASKO THROUGH TASK5 ARE THE NAMES OF 

* SUBROUTINES TO BE CALLED. THEY ARE EXECUTED IN ORDER, TASKO, 

* TASKl, . . .TASKS. WHEN AN INTERRUPT OCCURS, THE INTERRUPT 

* SERVICE ROUTINE IS EXECUTED, AND THE PROCESSOR CONTIMUES 

* WITH THE INSTRUCTION FOLLOWING THE IDLE INSTRUCTION. THIS 

* ROUTINE SELECTS THE TASK APPROPRIATE FOR THE CURRENT CYCLE, 

* CALLS THE TASK AS A SUBROUTINE, AND BRANCHES BACK TO THE IDLE 

* TO WAIT FOR THE NEXT SAMPLE INTERRUPT WHEN THE SCHEDULED TASK 

* HAS COMPLETED EXECUTION. RO HOLDS THE OFFSET FROM THE BASE 

* ADDRESS OF THE TASK TO BE EXECUTED. 





LDI 


5,R0 




LDI 


@ADDR,AR1 


WAIT 


IDLE 




•k 


ADD I 


*AR1,R0,R1 




SUBI 


1,R0 




LDILT 


5,R0 




CALLU 


Rl 


ic 


BR 


WAIT 


TSKSEQ 


.word 


TASKS 




.word 


TASK4 




.word 


TASK3 




.word 


TASK2 




-word 


TASKl 




.word 


TASKO 


ADDR 


• WORD 


TSKSEQ 



Initialize RO 

ARl holds the base address of the table 

Wait for the next interrupt 

Add the base address to the table 

Entry number 
Decrement RO 

If R0<0, reinitialize it to 5 
Execute appropriate task 



Address 
Address 
Address 
Address 
Address 
Address 



of TASKS 
of TASK4 
of TASK3 
of TASK2 
of TASKl 
of TASKO 
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12.3 Logical and Arithmetic Operations 

The TMS320C30 instruction set supports both integer and floating point 
arithmetic and logical operations. The basic functions of such instructions can 
be combined to form more complex operations. This section examines exam- 
ples of such operations, such as: 

Bit manipulation 

Block moves 

Bit-reversed addressing 

Integer and floating-point division 

Square root 

O Extended precision arithmetic 

• Floating point format conversion between IEEE and TMS320C30 
formats. 

12.3.1 Bit Manipulation 

The instructions of the TMS320C30 for the usual logical operations, such as 
AND, OR, NOT, ANDN, and XOR, can be used together with the shift in- 
structions for bit manipulation. In addition to these instructions, there is a 
special instruction, TSTB, for testing bits. TSTB does the same operation as 
AND, but the result of the logical AND is not written anywhere and is only 
used to set the condition flags. Examples 12-12 and 12-13 demonstrate the 
use of the several instructions for bit manipulation and testing. 

Example 12-12. Use of TSTB for Software-Controlled Interrupt 

* TITL USE OF TSTB FOR SOFTWARE-CONTROLLED INTERRUPT 



IN THIS EXAMPLE, ALL INTERRUPTS HAVE BEEN DISABLED BY 
RESETTING THE GIE BIT OF THE STATUS REGISTER. WHEN AN 
INTERRUPT ARRIVES, IT IS STORED IN THE IF REGISTER. THE 
PRESENT EXAMPLE ACTIVATES THE INTERRUPT SERVICE ROUTINE INTR 
WHEN IT DETECTS THAT INT2- HAS OCCURRED. 




TSTB 
CALLNZ 



4, IF 
INTR 



Check if bit 2 of IF is set, 

and, if so, call subroutine INTR 
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Example 12-13. Copy a Bit From One Location to Another 

* TITL COPY A BIT FROM ONE LOCATION TO ANOTHER 

* 

* BIT I OF Rl NEEDS TO BE COPIED TO BIT J OF R2 . 

* ARO POINTS TO A LOCATION HOLDING I, AND IT IS ASSUMED THAT THE 

* NEXT MEMORY LOCATION HOLDS THE VALUE J. 



Rl 



R2 



*ARO 



*(AR0+1) 



LDI 


1,R0 


LSH 


*ARO , RO ; 


TSTB 


R1,R0 


BZD 


CONT ; 


LDI 


1,R0 


LSH 


*+ARO(l),RO ; 


ANDN 


R0,R2 ; 


OR 


R0,R2 ; 



Shift 1 to align it with bit I 
Test the I-th bit of Rl 
If bit = 0, branch delayed 

Align 1 with J-th location 

If bit = 0, reset J-th bit of R2 

If bit = 1, set J-th bit of R2 



CONT 
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12.3.2 Block Moves 

Since the TMS320C30 directly addresses a large amount of memory, blocks 
of data or program code can be stored off-chip in slow memories and then 
loaded on-chip for faster execution. Data can also be moved from on-chip to 
off-chip for storage or for multiprocessor data trasfers. 

Such data transfers can be accomplished very efficiently in parallel with CPU 
operations, using the DMA. The DMA operation is explained in detail in an 
earlier section of this manual. An alternative to DMA is to perform data 
transfers under program control using load and store instructions in a repeat 
mode. Example 12-14 shows the case where a block of 512 floating-point 
numbers are transferred from external memory to block 1 of the on-chip RAM. 

Example 12-14. Block Move Under Program Control 



TITL BLOCK MOVE UNDER PROGRAM CONTROL 



extern 


.word 


OlOOOH 


blockl 


.word 


0809C00H 




LDI 


@extern,ARO 




LDI 


@blockl,ARl 




LDF 


*ARO++,RO 




RPTS 


510 




LDF 


*ARO++,RO 


II 


STF 


RO,*ARH-+ 




STF 


R0,*AR1 



Source address 
Destination address 

Load the first number 

Repeat following instruction 511 times 
Load the next number, and. . . 
store the previous one 

Store the last number 



12.3.3 Bit-Reversed Addressing 




For an efficient implementation of Fast Fourier Transforms (FFT), the 
TMS320C30 offers the capability of bit-reversed addressing. If the data to 
be transformed is in the correct order, the final result of the FFT is scrambled 
(in bit-reversed order). To recover the frequency-domain data in the correct 
order, certain memory locations have to be swapped. The bit-reversed ad- 
dressing mode offers the alternative of not doing this swapping. The next time 
data needs to be accessed, the access is done in a bit-reversed manner rather 
than sequentially. 

In bit-reversed addressing, IRO holds a value equal to one-half the size of the 
FFT, if real and imaginary data are stored in separate arrays. During accessing, 
the auxiliary register is indexed by IRO, but with reverse carry propagation. 
Example 12-15 illustrates a 512-point complex FFT being moved from the 
place of computation (pointed at by ARO) to a location pointed at by AR1. 
In this example, real and imaginary parts XR(i) and XI (i) of the data are not 
stored in separate arrays, but they are interleaved XR(0),XI(0),XR(1),XI(1 ),..., 
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LDI 
LDI 
LDI 
LDF 
RPTB 


512, IRO 

2,IR1 

511, RC ; 

*+ARO(l),Rl ; 

LOOP 


II 

1l 


LDF 
STF 


*ARO++(IRO)B,R 
R1,*+AR1(1) 


LOOP 
II 


LDF 
STF 


*+ARO(l),Rl ; 
R0,*AR1++(IR1) 
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XR(N-1 ),XI(N-1 ). Because of this arrangement, the length of the array Is 2N 
instead of N, and IRO is set to 512 instead of 256. 

Example 12-15. Bit- Reversed Addressing 

* 

* TITL BIT-REVERSED ADDRESSING 

* 

* THIS EXAMPLE MOVES THE RESULT OF THE 512-POINT FFT 

* COMPUTATION, POINTED AT BY ARC, TO A LOCATION POINTED AT 

* BY ARl. REAL AND IMAGINARY POINTS ARE ALTERNATING. 



Repeat 511+1 times 

Load first imaginary point 



; Load real value (and point 
to next location) and store 
the imaginary value 

Load next imaginary point and store 
; previous real value 



12.3.4 Integer and Floating-point Division 

Although division is not implemented as a single instruction in the 
TMS320C30, the instruction set provides the necessary capabilities for an ef- 
ficient division routine. Integer and floating-point division will be examined 
separately because different algorithms are used. 

72.3.4. 1 Integer Division 

Division is implemented on the TMS320C30 by repeated subtractions using 
SUBC, a special conditional subtract instruction. Consider the case of a 32-bit 
positive dividend with i significant bits (and 32-i sign bits), and a 32-bit po- 
sitive divisor with j significant bits (and 32-j sign bits). The repetition of the 
SUBC command i-j + 1 times produces a 32 bit result where the lower i-j + 1 
bits are the quotient, and the upper 31 -i+j bits, the remainder of the division. 

SUBC implements binary division in the same manner as in long division. The 
divisor (assumed to be smaller than the dividend) is shifted left i-j times to be 
aligned with the dividend. Then, using SUBC, the shifted divisor is subtracted 
from the dividend. For each subtract that does not produce a negative answer, 
the dividend is replaced by the difference. It is then shifted to the left, and a 
one is put in the LSB. If the difference is negative, the dividend is simply 
shifted left by one. This operation is repeated i-j+1 times. 

As an example, consider the division of 33 by 5 using both long division and 
the SUBC method. In this case, i=6, j=3, and the SUBC operation is repeated 
6-3+1 =4 times. 
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LONG DIVISION: 



00000000000000000000000000000101 



000000000000000000000000000001 1 

00000000000000000000000000100001 

-101 



1101 
-101 



11 



Quotient 



Remainder 



SUBC METHOD: 




00000000000000000000000000100001 
000000000000000000000000001 01 000 

Negative difference 

00000000000000000000000001 00001 
000000000000000000000000001 01 000 

0000000000000000000000000001 1 01 

4 

000000000000000000000000001 1 01 01 
000000000000000000000000001 01 000 

00000000000000000000000000001 1 01 

0000000000000000000000000001 1 01 1 
000000000000000000000000001 01 000 

Negative difference 
000000000000000000000000001 1 01 1 



I 

Remainder 



Quot. 



Dividend 
Divisor (aligned) 

(1st SUBC command) 

New Dividend + Quotient 
Divisor 

Difference (>0) (2nd SUBC command) 

New Dividend + Quotient 
Divisor 

Difference (>0) (3rd SUBC command) 



New Dividend + Quotient 
Divisor 

(4th SUBC command) 
Final Result 



When using the SUBC command, both the dividend and the divisor must be 
positive. Example 12-16 shows a realization of the integer division, where the 
sign of the quotient is properly handled. The last instruction before returning 
modifies the condition flag, in case subsequent operations depend on the sign 
of the result. 
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Example 12-16. Integer Division 



TITL INTEGER DIVISION 





SUBROUTINE 


DI\ 


* 






* 






* 


INPUTS: 




* 






* 


OUTPUT: 




* 






* 


REGISTERS 


USED: 


* 






* 


OPERATION: 




* 






* 






* 


CYCLES : 




* 








.globl 


DIVI 


SIGN 


. set 


R2 


TEMPE 


.set 


R3 


TEMP 


.set 


IRO 


COUNl 


.set 


IRl 



SIGNED INTEGER DIVIDEND IN RO , 
SIGNED INTEGER DIVISOR IN Rl. 

RO/Rl into RO. 

R0-R3, IRO, IRl 

1. NORMALIZE DIVISOR WITH DIVIDEND 

2. REPEAT SUBC 

3. QUOTIENT IS IN LSBs OF RESULT 

31-62 (DEPENDS ON AMOUNT OF NORMALIZATION) 



DIVI - SIGNED DIVISION 



Di 

* 


VI: 




* 


DETERMINE 


SIGN OF RESULT 




XOR 


R0,R1,SIGN 




ABSI 


RO 




ABSI 


Rl 




CMP I 


R0,R1 




BHID 


ZERO 



GET ABSOLUTE VALUE OF OPERANDS, 
Get the sign 



Divisor > dividend ? 
If so, return 

NORMALIZE OPERANDS. USE DIFFERENCE IN EXPONENTS AS SHIFT COUNT 
FOR DIVISOR, AND AS REPEAT COUNT FOR 'SUBC. 

Normalize dividend 

PUSH as float 

POP as int 

Get dividend exponent 

Normalize divisor 

PUSH as float 

POP as int 

Get divisor exponent 

Get difference in exponents 

Align divisor with dividend 



FLOAT 


RO,TEMP 


PUSHF 


TEMPF 


POP 


COUNT 


LSH 


-24, COUNT 


FLOAT 


Rl, TEMPF 


PUSHF 


TEMPF 


POP 


TEMP 


LSH 


-24, TEMP 


SUBI 


TEMP, COUNT 


LSH 


COUNT, Rl 




12-25 



Software Applications - Logical and Arithmetic Operations 



DO COUNT+1 SUBTRACT & SHIFTS. 

RPTS COUNT 
SUBC R1,R0 

MASK OFF THE LOWER COUNT+1 BITS OF RO 

SUBRI 31, COUNT ; Shift count is (32 - (COUNT+1)) 

LSH COUNT, RO ; Shift left 

NEGI COUNT 

LSH COUNT, RO ; Shift right to get result 

CHECK SIGN AND NEGATE RESULT IF NECESSARY. 



NEGI R0,R1 

ASH -31, SIGN 

LDINZ R1,R0 

CMPI 0,R0 



Negate result 

Check sign 

If set, use negative result 

Set status from result 



RETS 
* 

* RETURN ZERO. 

* 

ZERO: 




LDI 0,R0 

RETS 

. end 

If the dividend is less than the divisor and fractional division is desired, a di- 
vision can be performed after determining the desired accuracy of the quotient 
in bits. If the desired accuracy is k bits, start by shifting the dividend left by k 
positions. Then apply the algorithm described above, where i should now be 
replaced by i+k. It is assumed that i+k is less than 32. 



12.3.4.2 Computation of Floating-point Inverse and Division 

This section presents a method of implementing floating-point division on the 
TMS320C30. Since the algorithm outlined here computes the inverse of a 
number v, to divide y/v, multiply y by the inverse of v. 

The computation of 1 /v is based on the following iterative algorihm. At the 
i-th iteration, the estimate x[i] of 1/v is computed from v, and the previous 
estimate x[i-1] according to the formula: 

x[i] =x[i-1] • (2.0- v*x[i-1]) 

To start the operation, an initial estimate x[0] is needed. If v=a*2^ a good 
initial estimate is: 

x[0] =1.0* 2-e-1 

Example 12-17 shows the implementation of this algorithm on the 
TMS320C30, where the iteration has been applied 5 times. The choice of the 
number of iterations was based on the desire to have maximum accuracy. The 
accuracy offered by the single-precision floating-point format is 
2'23=1.l92E-7. If more accuracy is desired, more iterations can be used. If 
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less accuracy is acceptable, the execution speed of this implementation can 
be increased by reducing the number of iterations. 

This algorithm properly treats the boundary conditions, when the input num- 
ber is either zero or it has a very large value. When the input is zero, the ex- 
ponent e=-128. Then the calculation of x[0] yields an exponent equal to 
-(-128)-1 =127 and the algorithm will overflow and saturate. On the other 
hand, in the case of a very large number, e=1 27, the exponent of x[0] will be 
-127-1 =-128. This will cause the algorithm to yield zero, which is a reason- 
able handling of that boundary condition. 
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Example 12-17. Inverse of a Floating-Point Number 

* 

* TITL INVERSE OF A FLOATING-POINT NUMBER 
* 

* 

* SUBROUTINE INVF 
* 

* 

* THE FLOATING-POINT NUMBER V IS STORED IN RO . AFTER THE 

* COMPUTATION IS COMPLETED, 1/v IS ALSO STORED IN RO . 
* 

* TYPICAL CALLING SEQUENCE: 

* LDF V, RO 

* CALL INVF 
* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* + 

* RO I V = NUMBER TO FIND THE RECIPROCAL OF (UPON THE CALL) 

* RO I 1/v (UPON THE RETURN) 
* 

* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO , Rl, R2 , R3 

* REGISTER CONTAINING RESULT: RO 
* 

* CYCLES: 35 WORDS: 32 
* 

* 

.global INVF 
* 

INVF: LDF R0,R3 ; v is saved for later. 

ABSF RO ; The algorithm uses v = |v|. 

* 

* EXTRACT THE EXPONENT OF v. 
* 

PUSHF RO 
POP Rl 
ASH -24, R ; The 8 LSBs of Rl contain the exponent 

* ; of v. 
* 

* x[0] FORMATION GIVEN THE EXPONENT OF v. 

; Now we have -e-1, the exponent of x[0] 
; Now Rl = x[0] = 1.0 * 2**(-e-l). 



NEGI 


Rl 


SUBI 


1,R1 


ASH 


24, Rl 


PUSH 


Rl 


POPF 


Rl 
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NOW THE ITERATIONS BEGIN. 

MPYF R1,R0,R2 ; R2 = v * x[0] 

SUBRF 2.0,R2 ; R2 = 2.0 - v * x[0] 

MPYF R2,R1 ; Rl=x[l]=x[0]*(2.0-v*x[0]) 

MPYF R1,R0,R2 ; R2 = V * X [ 1 ] 

SUBRF 2.0,R2 ; R2=2.0-v*x[l] 

MPYF R2,R1 ; Rl=x[2]=x[l]*(2.0-V*x[l]) 

MPYF R1,R0,R2 ; R2 = v * x[2] 

SUBRF 2.0,R2 ; R2 = 2 . - v * x [2 ] 

MPYF R2,R1 ; Rl=x[3]=x[2]*(2.0-v*x[2]) 

MPYF R1,R0,R2 ; R2 = v * x[3] 

SUBRF 2.0,R2 ; R2=2.0-v*x[3] 

MPYF R2,R1 ; Rl = x[4] = x[3] * (2.0 - v * x[3]) 

RND Rl ; This minimizes error in the LSBs. 

FOR THE LAST ITERATION WE USE THE FORMULATION: 
x[5] = (x[4] * (1.0 - (v * x[4]))) + x[4] 

; R2 = V * x[4] = 1.0. .01.. => 1 

; R2 = 1.0 - V * x[4] = 0.0. .01... => 

; R2 = x[4] * (1.0 - V * x[4]) 

; R2 = x[5] = (x[4]*(1.0-(v*x[4]) ) )+x[4] 

RND R1,R0 ; Round since this is follow by a MPYF. 

NOW THE CASE OF v < IS HANDLED.- 

NEGF R0,R2 

LDF R3,R3 ; This sets condition flags. 

LDFN R2,R0 ; If v < 0, then RO = -RO 

RETS 

END 

. end 



MPYF 


R1,R0,R2 


SUBRF 


1 . , R2 


MPYF 


R1,R2 


ADDF 


R2,R1 



12.3.5 Square Root 



The implementation of the square root on the TMS320C30 is done by an it- 
erative algorithm very similar to the one used for the computation of the in- 
verse. This algorithm computes the inverse of the square root of a number v, 
1/SQRT(v). To derive SQRT(v), multiply this result by v. Since in many ap- 
plications, division by the square root of a number is desirable, the output of 
the algorithm saves the effort to compute the inverse of the square root. 

At the i-th iteration, the estimate x[i] of 1/SQRT(v) is computed from v and 
the previous estimate x[i-1] according to the formula: 

x[i] = x[i-1] * (1.5 - (v/2) * x[i-1] * x[i-1]) 

To start the operation, an initial estimate x[0] is needed. If v=a*2® a good 
initial estimate is: 

x[0] =1.0* 2-e/2 
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Example 12-18 shows the implementation of this algorithm on the 
TMS320C30, where the iteration has been applied 5 times. The choice of the 
number of iterations was based on the desire to have maximum accuracy. If 
more accuracy is desired, more iterations can be used. If less accuracy is ac- 
ceptable, the execution speed of this implementation may be increased by re- 
ducing the number of iterations. 
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Example 12-18. Square Root of a Floating-Point Number 



TITL SQUARE ROOT OF A FLOATING-POINT NUMBER 



SUBROUTINE SQRT 

THE FLOATING POINT NUMBER v IS STORED IN RO . AFTER THE 
COMPUTATION IS COMPLETED, SQRT(v) IS ALSO STORED IN RO . NOTE 
THAT THE ALGORITHM ACTUALLY COMPUTES l/SQRT(v). 

TYPICAL CALLING SEQUENCE: 

LDF V, RO 
CALL SQRT 
ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

H 

RO I V = NUMBER TO FIND THE SQUARE ROOT OF 

I (UPON THE CALL) 
RO I SQRT(v) (UPON THE RETURN) 

REGISTER USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2, R3 
REGISTER CONTAINING RESULT: RO 

CYCLES: 39 WORDS: 33 

.global SQRT 

EXTRACT THE EXPONENT OF V. 



SQRT: 



LDF 

RETSLE 

PUSHF 

POP 

ASH 



R0,R3 

RO 
Rl 
-25, Rl 



Save V 

Return if number non-positive 



The 8 LSBs of Rl contain 1/2 the exponent 
of V. 



X[0] FORMATION GIVEN THE EXPONENT OF V. 





NEGI 


Rl 








ASH 


24, 


Rl 






PUSH 


Rl 








POPF 


Rl 




) 


GENERATE V/2 


. 








MPYF 


0.£ 


,R0 




NOW 


THE ITERATIONS 


BEGIN. 




MPYF 


Rl, 


Rl, 


R2 ; 




MPYF 


RO, 


R2 






SUBRF 


1.5 


,R2 






MPYF 


R2, 


Rl 





Now Rl = x[0] = 1.0 * 2**(-e/2) 



R2 = x[0] * x[0] 

R2 = (v/2) * x[0] * x[0] 

R2 = 1.5 - (v/2) * x[0] * x[0] 

Rl = x[l] = x[0] * 

(1.5 - (v/2)*x[0]*x[0] ) 
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MPYF 


R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1.5,R2 


MPYF 


R2,R1 


MPYF 


R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1.5,R2 


MPYF 


R2,R1 


MPYF 


R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1.5,R2 


MPYF 


R2,R1 


MPYF 


R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1 . 5 , R2 


MPYF 


R2,R1 


RND 


R1,R0 


MPYF 


R3,R0 


RETS 





R2 = x[l] * x[l] 

R2 = (v/2) * x[l] * x[l] 

R2 = 1.5 - (v/2) * x[l] * x[l] 

Rl = x[2] = x[l] * 

(1.5 - (v/2)*x[l]*x[l]) 
R2 = x[2] * x[2] 
R2 = (v/2) * x[2] * x[2] 
R2 = 1.5 - (v/2) * x[2] * x[2] 
Rl = x[3] = x[2] 

* (1.5 - (v/2)*x[2]*x[2] ) 

R2 = x[3] * x[3] 

R2 = (v/2) * x[3] * x[3] 

R2 = 1.5 - (v/2) * x[3] * x[3] 

Rl = x[4] = x[3] 

* (1.5 - (v/2)*x[3]*x[3] ) 

R2 = x[4] * x[4] 

R2 = (v/2) * x[4] * x[4] 

R2 = 1.5 - (v/2) * x[4] * x[4] 

Rl = x[5] = x[4] 

* (1.5 - (v/2)*x[4]*x[4] ) 



; Round 

; Sqrt(v) from sqrt (v** (-1) ) 



end 



. end 




12.3.6 Extended- Precision Arithmetic 

The TMS320C30 offers 32 bits of precision for integer arithmetic, and 24 bits 
of precision in the mantissa for floating point arithmetic. For higher precision 
in floating-point operations, the eight extended-precision registers RO to R7 
contain eight more bits of accuracy. Since no cortrparable extension is avail- 
able for fixed-point arithmetic, this section discusses how4ixed-point double 
precision can be achieved using the capabilities of the processor. The tech- 
nique consists of performing the arithmetic by parts, similar to the way in 
which longhand arithmetic is done. 

The instruction set has operations ADDC (Add with Carry) and SUBB (Sub- 
tract with Borrow) which use the status carry bit for extended -precision 
arithmetic. The carry bit is affected by the arithmetic operations of the ALU, 
and the rotate and shift instructions. It can also be manipulated directly by 
setting the status register to certain values. For proper operation, the overflow 
mode bit should be reset (OVM = 0) so the accumulator results will not be 
loaded with the saturation values. Example 12-19 and Example 12-20 show 
64-bit addition and 64-bit subtraction. The first operand is stored in the reg- 
isters RO (low word) and Rl (high word). The second operand is stored in 
R2 and R3 respectively. The result is stored in RO and Rl . 
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Example 12-19. 64-Bit Addition 

* TITL 64-BIT ADDITION 

* 

* TWO 64-BIT NUMBERS ARE ADDED TO EACH OTHER PRODUCING A 64-BIT 

* RESULT, THE NUMBERS X (R1,R0) AND Y (R3,R2) ARE ADDED, 

* RESULTING IN W (R1,R0) . 





Rl 


RO 




+ 


R3 


R2 






Rl 


RO 




ADD I 


R2, 


,R0 



ADDC R3,R1 

Example 12-20. 64-Bit Subtraction 

* TITL 64-BIT SUBTRACTION 
* 

* TWO 64-BIT NUMBERS ARE SUBTRACTED FROM EACH OTHER PRODUCING 

* A 64-BIT RESULT. THE NUMBERS X (R1,R0) AND Y (R3,R2) ARE 

* SUBTRACTED, RESULTING IN W (R1,R0). 

* 

* Rl RO 

* - R3 R2 



Rl RO 



SUBI R2,R0 
SUBB R3,R1 

When two 32-bit numbers are multiplied, a 64-bit product results. The pro- 
cedure for multiplication is to split the 32-bit magnitude values of the multi- 
plicand X and the multiplier Y into two parts (X1,X0) and (X3,X2) respectively 
with 16 bits each. The operation is done on unsigned numbers, and the pro- 
duct is adjusted for the sign bit. Example 1 2-21 shows the implementation 
of a 32 bit X 32 bit multiplication. 
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Example 12-21. 32 by 32 Bit Multiplication 



TITL 32 X 32 BIT MULTIPLICATION 



SUBROUTINE EXTMPY 

FUNCTION: TWO 32-BIT NUMBERS ARE MULTIPLIED, PRODUCING A 64-BIT 
RESULT. THE TWO NUMBERS (X and Y) ARE EACH SEPARATED INTO TWO 
PARTS (XI XO) AND (Yl YO), WHERE XO , XI, YO , AND Yl ARE 16 BITS. 
THE TOP BIT IN XI AND Yl IS THE SIGN BIT. THE PRODUCT IS 
IN TWO WORDS (WO AND Wl). THE MULTIPLICATION IS PERFORMED ON 
POSITIVE NUMBERS, AND THE SIGN IS DETERMINED AT THE END. 



X 



XI XO 
Yl YO 



XO*YO 
X0*Y1 
X1*Y0 
X1*Y1 



BITS OF PRODUCTS 

(NOT COUNTING SIGN) PRODUCT 

16+16 PI 

16+16 P2 

16+16 P3 

16+16 P4 



Wl 



WO 



ARGUMENT ASSIGNMENTS 
ARGUMENT | FUNCTION 

+ 

RO I MULTIPLIER AND LOW WORD OF THE PRODUCT 

Rl I MULTIPLICAND AND UPPER WORD OF THE PRODUCT 



REGISTERS USED AS INPUT: RO , Rl 

REGISTERS MODIFIED: RO, Rl, R2 , R3 , R4 , ARO , ARl, 

REGISTER CONTAINING RESULT: R0,R1 



CYCLES: 28 (WORST CASE) 



WORDS: 25 



■GLOBAL EXTMPY 



EXTMPY 



X0R3 
ABSI 
ABSI 



R0,R1,AR0 

RO 

Rl 



Store sign 

Absolute values of X 
and Y 



SEPARATE MULTIPLIER AND MULTIPLICAND INTO TWO PARTS 




LDI 


-16, ARl 


LSH3 


AR1,R0,R2 


AND 


0FFFFH,R0 


LSH3 


AR1,R1,R3 


AND 


0FFFFH,R1 



R2 = XI = Upper 16 bits of X 
RO = XO = Lower 16 bits of X 
R3 = Yl = Upper 16 bits of Y 
Rl = YO = Lower 16 bits of Y 
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CARRY OUT THE MULTIPLICATION 



DONE 



MPYI3 


R0,R1, 


R4 


MPYI 


R3,R0 




MPYI 


R2,R1 




ADD I 


R0,R1 




MPYI 


R2,R3 




LDI 


R1,R2 




LSH 


16, R2 




CMP I 


0,ARO 




BGED 


DONE 




AND 


0FFFFH,R1 


ADDI3 


R4,R2, 


RO 


ADDC3 


R1,R3, 


Rl 


^E THE 


PRODUCT 


IF 


NOT 


RO 




ADDI 


1,R0 




NOT 


Rl 




ADDC 


0,R1 




RETS 






• end 







XO*YO = PI 
X0*Y1 = P2 
X1*Y0 = P3 
P2+P3 
X1*Y1 = P4 



Lower 16 bits of P2+P3 

Check the sign of the product 

If >0, multiplication complete (delayed) 

Upper 16 bits of P2+P3 

WO = RO = Lower word of the product 

Wl = Rl = Upper word of the product 

NUMBERS ARE OF OPPOSITE SIGN 



12.3.7 Floating-point Format Conversion: IEEE to/from TMS320C30 

In fixed-point arithmetic, the binary point that separates the integer from the 
fractional part of the number is fixed at a certain location. For example, if it 
is chosen that a 32-bit number has the binary point after the most significant 
bit (which is also the sign bit), only fractional numbers (numbers with abso- 
lute values less that 1 ), can be represented. In this case, it is said that we have 
a Q31 number, where 31 is the number of fractional bits. All operations as- 
sume that the binary point is fixed at this location. 

The fixed-point system, although simple to implement in hardware, imposes 
limitations in the dynamic range of the represented number, which causes 
scaling problems in many applications. The difficulty is avoided by using 
floating-point numbers. A floating-point number consists of a mantissa m 
multiplied by base b raised to an exponent e: 

m* b^ 

In current hardware implementations, the mantissa is typically a normalized 
number with absolute value between 1 and 2, and the base is b=2. Although 
the mantissa is represented as a fixed-point number, the actual value of the 
overall number floats the binary point because of the multiplication by b^ The 
exponent e is an integer whose value determines the position of the binary 
point in the number. IEEE has established a standard format for the repre- 
sentation of floating-point numbers. 

In order to achieve higher efficiency in the hardware implementation, the 
TMS320C30 uses a floating-point format that differs from the IEEE standard. 
This section describes briefly the two formats and presents software routines 
to convert between them. 
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TMS320C30 floating-point format: 



23 



e s f 



In a 32-bit word representing a floating-point number, the first 8 bits corre- 
spond to the exponent, expressed in two's-complement format. There is one 
bit for sign, and 23 bits for the mantissa. The mantissa is expressed in two's- 
complement form with the binary point after the most significant non-sign bit. 
Since this bit is the complement of the sign bit s, it is suppressed. In other 
words, the mantissa actually has 24 bits. One special case occurs when 
e=-128. In this case, the number is interpreted as zero independent of the 
values of s and f (which are by default set to zero). To summarize, the values 
of the represented numbers in the TMS320C30 floating-point format are: 



2^ • (01 .f) 


if s=0 


2^ • (1 0.f) 


if s=1 





ife=-128 



IEEE floating-point format: 



23 



s e f 



The IEEE floating-point format uses sign-magnitude notation for the mantissa, 
and offset by 127 for the exponent. In a 32-bit word representing a float- 
ing-point number, the first bit is the sign bit. The next 8 bits correspond to 
the exponent, expressed in an offset-by-127 format (the actual exponent is 
e-127). The following 23 bits represent the absolute value of the mantissa 
with the most significant 1 implied. The binary point is after this most signif- 
icant 1 . In other words, the mantissa actually has 24 bits. There are several 
special cases, summarized below. 
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The values of the represented numbers in the IEEE floating-point format are: 

(-l)s* 2e-'>27« (oi.f) if0<e<255 

Special cases: 

(-1)S"0.0 ife=Oandf=0 (zero) 

(-1 )s • 2-"' 26 • (o.f) if e=0 and f<>0 (denormalized) 

(-1 )s • infinity if e=255 and f=0 (infinity) 

NaN if e=255 and f<>0 (Not a Number) 



Based on these definitions of the formats, two versions of the conversion 
routines were developed. One version handles the complete definition of the 
formats. The other ignores some of the special cases (typically the ones that 
are very rarely used), but it has the benefit that it executes faster than the 
complete conversion. For this discussion, they are referred to as the complete 
version and the the fast version. 

12.3.7.1 IEEE to TMS320C30 Floating-Point Format Conversion 

The fast version of the IEEE-to-TMS320C30 conversion routine was originally 
developed by Keith Henry of Apollo Computer, Inc. The other routines were 
based on this initial input. Example 12-22 shows the fast conversion from 
IEEE to TMS320C30 floating-point format. It properly handles the general 
case when 0<e<255, and zeros (i.e., e=0 and f=0). The other special cases 
(denormalized, infinity, and NaN) are not treated and, if present, will give er- 
roneous results. 
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Example 12-22. TMS320C30 To IEEE Conversion (Fast Version) 

* TITL IEEE TO TMS320C30 CONVERSION (FAST VERSION) 



SUBROUTINE 



FMIEEE 



FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE 

320C30 FLOATING POINT NUMBERS. THE NUMBER TO 
BE CONVERTED IS IN THE LOWER 32 BITS OF RO . 
THE RESULT IS STORED IN THE UPPER 32 BITS OF RO , 
UPON ENTERING THE ROUTINE, ARl POINTS TO THE 
FOLLOWING TABLE: 

(0) OxFFSOOGGO < — ARl 

(1) OxFFOOOOOO 

(2) GxVFOOOOGG 

(3) GxSGOOOGGO 

(4) GxSlGOGGOG 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

RG I NUMBER TO BE CONVERTED 

ARl I POINTER TO TABLE WITH CONSTANTS 

REGISTERS USED AS INPUT: RG, ARl 
REGISTERS MODIFIED: RG , Rl 
REGISTER CONTAINING RESULT: RG 

NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



* CYCLES: 12 (WORST CASE) WORDS: 12 



.global FMIEEE 



FMIEEE 



NEG 




AND 3 


RG,*AR1,R1 


BND 


NEG 


ADD I 


R0,R1 


LDIZ 


*+ARl(l) ,R1 


SUB I 


*+ARl(2) ,R1 


PUSH 


Rl 


POPF 


RG 


RETS 




PUSH 


Rl 


POPF 


RG 


NEGF 


RO,RO 


RETS 





Replace fraction with G 

Test sign 

Shift sign and exponent inserting G 

If all zero, generate C3G zero 

Unbias exponent 

Load this as a fit. pt. number 



Load this as a fit. pt. number 
Negate if original sign negative 



Example 12-23 is the complete conversion between IEEE and TMS320C30 
formats. In addition to the general case and the zeros, it handles the special 
cases as follows: 



If NaN (e=255, f<>0), the number is returned intact. 

If infinity (e=255, f=0); the output is saturated to the most 

negative number respectively. 



positive or 
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If denormalized (e=0, f<>0), two cases are considered. If the MSB of 
f is 1 , tlie number is converted to TMS320C30 format. Otiierwise, an 
underflow occurs and the number is set to zero. 
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Example 12-23. IEEE to TMS320C30 conversion (complete version) 

* 

* TITL IEEE TO TMS320C30 CONVERSION (COMPLETE VERSION) 



SUBROUTINE 



FMIEEEl 



FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE 320C30 

FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE LOWER 32 BITS OF RO . THE RESULT IS STORED 
IN THE UPPER 32 BITS OF RO . 

UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 

(0) 0XFF800000 < — ARl 

(1) OxFFOOOOOO 

(2) 0X7F000000 

(3) 0x80000000 

(4) 0x81000000 

(5) 0X7F800000 

(6) 0x00400000 

(7) 0X007FFFFF 

(8) Ox7F7FFFFF 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

RO I NUMBER TO BE CONVERTED 

ARl I POINTER TO TABLE WITH CONSTANTS 

REGISTERS USED AS INPUT: RO , ARl 
REGISTERS MODIFIED: RO, Rl 
REGISTER CONTAINING RESULT: RO 

NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO INITIALIZE 
IT IN THE CALLING PROGRAM. 



CYCLES: 23 (WORST CASE) 
.global FMIEEEl 



WORDS: 34 




FM 


lEEEl LDI 
AND 
BZ 


R0,R1 

*+ARl(5) ,R1 
UNNORM 




XOR 
BNZ 


*-t-ARl(5) ,R1 
NORMAL 


* 


HANDLE NaN 


AND INFINITY 




TSTB *+ARl(7),R0 

RETSNZ 

LDI RO,RO 

LDFGT *+ARl(8),R0 

LDFN 

*-l-ARl(5) ,R0 

RETS 



If e=0, number is either or 
unnormalized 

If e<255, use regular routine 



Return if NaN 

If positive, infinity= 
most positive number 

If negative, infinity= 

most negative number RETS 
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HANDLE ZEROS AND UNNORMALIZED NUMBERS 



UNNORM 


TSTB 


*+ARl(6) ,R0 




LDFZ 


*+ARl(3) ,R0 




RETSZ 






XOR 


*+ARl(6) ,R0 




END 


NEGl 




LSH 


1,R0 




SUBI 


*+ARl(2) ,R0 




PUSH 


RO 




POPF 


RO 




RETS 




NEGl 


POPF 


RO 




NEGF 


RO,RO 




RETS 




* HANDLE THE 

•k 


REGULAR CASES 


NORMAL 


AND 3 


R0,*AR1,R1 




END 


NEG 




ADD I 


R0,R1 




SUBI 


*+ARl(2) ,R1 




PUSH 


Rl 




POPF 


RO 




RETS 




NEG 


POPF 


RO 




NEGF 


RO,RO 




RETS 





Is the iiisb of f equal to 1? 
If not, force the number to zero 
and return 

If (msb of f)=l, make it 

Eliminate sign bit and line up mantissa 
Make e=-127 

Put number in floating point format 
If negative, negate RO 



Replace fraction with 

Test sign 

Shift sign and exponent inserting 

Unbias exponent 

Load this as a fit. pt. number 



Load this as a fit. pt. number 
Negate if original sign negative 



12.3.7.2 TMS320C30 to IEEE Floating-Point Format conversion 

The vast majority of the numbers represented by the TI\/IS320C30 format are 
covered by the general IEEE format and the representation of zeros. The only 
special case to consider is when e=-127 in the TMS320C30 format. This 
corresponds to an denormalized number in IEEE format, and it is ignored in the 
fast version, while it is treated properly in the complete version. Example 
12-24 shows the fast, and Example 12-25, the complete version of the 
TMS320C30-to-IEEE conversion. 
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Example 12-24. TMS320C30 to IEEE Conversion (Fast Version) 

* 

* TITL TMS320C30 TO IEEE CONVERSION (FAST VERSION) 



SUBROUTINE TOIEEE 

FUNCTION: CONVERSION BETWEEN THE 320C30 FORMAT AND THE IEEE 

FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO . THE RESULT WILL BE IN 
THE LOWER 32 BITS OF RO . 

UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE: 

(0) OxFFSOOOOO < — ARl 

(1) OxFFOOOOOO 

(2) GxVFOOOOOO 

(3) 0x80000000 

(4) 0x81000000 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

RO I NUMBER TO BE CONVERTED 

ARl I POINTER TO TABLE WITH CONSTANTS 

REGISTERS USED AS INPUT: RO, ARl 
REGISTERS MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 



NOTE; 



SINCE THE STACK POINTER ' SP ' IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



CYCLES: 14 (WORST CASE) WORDS: 15 
.global TOIEEE 



TOIEEE 



LDF 

LDFZ 

BND 

ABSF 

LSH 

PUSHF 

POP 

ADD I 

LSH 

RETS 



RO,RO 

*+ARl(4) ,R0 

NEG 

RO 

1,R0 

RO 

RO 

*+ARl(2) ,R0 

-1,R0 



Determine the sign of the number 
If zero, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value of the number 
Eliminate the sign bit in RO 

Place number in lower 32 bits of RO 
Add exponent bias (127) 
Add the positive sign 




NEG 



POP 
ADD I 
LSH 
ADD I 
RETS 



RO 

*+ARl(2) ,R0 
-1,R0 
*+ARl(3) ,R0 



Place number in lower 32 bits of RO 
Add exponent bias (127) 
Make space for the sign 
Add the negative sign 
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Example 12-25. TMS320C30 to IEEE Conversion (Complete Version) 

* 

* TITL TMS320C30 TO IEEE CONVERSION (COMPLETE VERSION) 



SUBROUTINE 



TOIEEEl 



FUNCTION: CONVERSION BETWEEN THE 320C30 FORMAT AND THE IEEE 

FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO . THE RESULT WILL BE 
IN THE LOWER 32 BITS OF RO . 



UPON ENTERING THE ROUTINE, ARl POINTS TO THE FOLLOWING TABLE; 
< — ARl 



(0) 


OxFFSOOOOO 


(1) 


OxFFOOOOOO 


(2) 


OxVFGOOOOO 


(3) 


0x80000000 


(4) 


0x81000000 


(5) 


0x7F800000 


(6) 


0x00400000 


(7) 


OxOOTFFFFF 


(8) 


Ox7F7FFFFF 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

H 

RO i NUMBER TO BE CONVERTED 

ARl I POINTER TO TABLE WITH CONSTANTS 

REGISTERS USED AS INPUT: RO , ARl 
REGISTERS MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 

NOTE: SINCE THE STACK POINTER ' SP ' IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



* 


CYCLES : 


31 (WORST C 


* 


.global 


TOIEEEl 


TOIEEEl 


LDF 


RO,RO 




LDFZ 


*+ARl(4) ,R0 




BND 


NEG 




ABSF 


RO 




LSH 


1,R0 




PUSHF 


RO 




POP 


RO 




ADD I 


*+ARl(2) ,R0 




LSH 


-1,R0 



WORDS: 25 



Determine the sign of the number 
If zero, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value of the number 
Eliminate the sign bit in RO 

Place number in lower 32 bits of RO 
Add exponent bias (127) 
Add the positive sign 
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CONT 



NEG 



TSTB 


*+ARl(5) 


,R0 


RETSNZ 






TSTB 


*+ARl(7) 


,R0 


RETSZ 






PUSH 


RO 




POPF 


RO 




LSH 


-1,R0 




PUSHF 


RO 




POP 


RO 




ADD I 


*+ARl(6) 


,R0 


RETS 






POP 


RO 




BRD 


CONT 




ADD I 


*+ARI(2) 


,R0 


LSH 


-1,R0 




ADD I 


*+ARl(3) 


,R0 



If E>0, return 

If E=0 & F=0, return 

Move F right by one bit 

Add to F a msb of 1 

Place number in lower 32 bits of RO 

Add exponent bias (127) 
Make space for the sign 
Add the negative sign 
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12.4 Application-Oriented Operations 

The TMS320C30 has been designed to provide efficient implementations of 
digital signal processing algorithms. The architecture and the instruction set 
of the device include features that facilitate the solution of numerically inten- 
sive problems. This section presents examples of applications using these 
features, such as companding, filtering. Fast Fourier Transforms (FFT), and 
matrix arithmetic. 

12.4.1 Companding 

In the area of telecommunications, one of the primary concerns is the conser- 
vation of the channel bandwidth, while at the same time preserving high 
speech quality. This is achieved by quantizing the speech samples logarith- 
mically. It has been demonstrated that an 8-bit logarithmic quantizer produces 
speech quality equivalent to a 13-bit uniform quanitizer. The logarithmic 
quantization is achieved by companding (COMpress/exPANDing). Two in- 
ternational standards have been established for companding: the p-law (used 
in the United States and Japan), and the A-law (used in Europe). Detailed 
descriptions of p-law and A-law companding are presented in an application 
report on companding routines included in the book "Digital Signal Process- 
ing Applications with the TMS320 Family". 

During transmission, logarithmically compressed data in sign-magnitude form 
are transmitted along the communications channel. If any processing is nec- 
essary, these data should be expanded to a 14-bit (for p-'aw) or 13-bit (for 
A-law) linear format. This operation is done upon receiving the data at the 
digital signal processor. After processing, and in order to continue trans- 
mission, the result is compressed back to 8-bit format and transmitted through 
the channel. 

Examples 12-26 and 12-27 show p-law compression and expansion (i.e., li- 
near to M-law and jj-law to linear conversion), while examples 12-28 and 
12-29 show A-law compression and expansion. For expansion, using a 
look-up table offers an alternative approach. It trades memory space for speed 
of execution. Since the compressed data is 8-bits long, a table with 256 en- 
tries can be constructed containing the expanded data. If the compressed data 
is stored in the register ARO, the following two instructions will put the ex- 
panded data in register RO: 

ADDI §TABL,ARO ; @TABL = BASE ADDRESS OF TABLE 
LDI *ARO,RO ; PUT EXPANDED NUMBER IN RO 

The same look-up table approach could be used for compression, but the re- 
quired table length would then be 1 6,384 words for M-law or 8,1 92 words for 
A-law. If this memory size is not acceptable, the subroutines presented in 
Examples 12-26 or 12-28. should be used. 
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Example 12-26. U-Law compression 



TITL U-LAW COMPRESSION 



SUBROUTINE MUCMPR 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

RO I NUMBER TO BE CONVERTED 

REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2 , SP 
REGISTER CONTAINING RESULT: RO 

NOTE: SINCE THE STACK POINTER ' SP ' IS USED IN THE COMPRESSION 
ROUTINE 'MUCMPR', MAKE SURE TO INITIALIZE IT IN THE 
THE CALLING PROGRAM. 



CYCLES: 20 WORDS: 17 



.global MUCMPR 




MUCMPR LDI 


R0,R1 


ABSI 


RO,RO 


CMP I 


1FDEH,R0 ; 


LDIGT 


1FDEH,R0 ; 


ADD I 


33, RO 


FLOAT 


RO ; 


MPYF 


0. 03125, RO ; 


LSH 


1,R0 ; 


PUSHF 


RO 


POP 


RO ; 


LSH 


-20, RO ; 


LDI 


0,R2 


LDI 


R1,R1 ; 


LDILT 


80H,R2 ; 


ADD I 


R2,R0 ; 


NOT 


RO ; 


RETS 





Save sign of number 

If RO>OxlFDE, 
saturate the result 
Add bias 

Normalize : ( seg+5 ) OWXYZx . . . x 
Adjust segment number by 2** (-5) 
(seg)WXYZx. . .X 

Treat number as integer 
Right- justify 



If number is negative, 

set sign bit 
RO = compressed number 
Reverse all bits for transmission 
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Example 12-27. U-Law Expansion 



* TITL 'U-LAW EXPANSION' 



SUBROUTINE 



MUXPND 



ARGUMENT ASSIGNMENTS; 
ARGUMENT | FUNCTION 



RO 



NUMBER TO BE CONVERTED 



REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2, SP 
REGISTER CONTAINING RESULT: RO 



* CYCLES: 20 (WORST CASE) 



WORDS : 14 



.global MUXPND 



MUXPND NOT 


RO,RO 


LDI 


RO,RO 


AND 


0FH,R1 


LSH 


1,R1 


ADDI 


33, Rl 


LDI 


R0,R2 


LSH 


-4,R0 


AND 


7,R0 


LSH3 


R0,R1,R0 


SUBI 


33, RO 


TSTB 


80H,R2 


RETSZ 




NEGI 


RO 


RETS 





Complement bits 

Isolate quantization bin 

Add bias to introduce Ixxxxl 
Store for sign bit 

Isolate segment code 
Shift and put result in RO 
Subtract bias 
Test sign bit 

Negate if a negative number 
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Example 12-28. A- Law Compression 



TITL A-LAW COMPRESSION 



SUBROUTINE 



ACMPR 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

RO I NUMBER TO BE CONVERTED 

REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO , Rl, R2 , SP 
REGISTER CONTAINING RESULT: RO 

NOTE: SINCE THE STACK POINTER ' SP ' IS USED IN THE COMPRESSION 
ROUTINE 'ACMPR', MAKE SURE TO INITIALIZE IT IN THE 
CALLING PROGRAM. 



CYCLES: 22 



WORDS: 19 



globl ACMPR 



ACMPR 



END 




LDI 

ABSI 

CMP I 

BLED 

CMP I 

LDIGT 

LSH 

FLOAT 

MPYF 

LSH 

PUSHF 

POP 

LSH 

LDI 

LDI 

LDILT 

ADDI 

XOR 

RETS 



R0,R1 

RO,RO 

1FH,R0 

END 

OFFFH,RO 

OFFFH,RO 

-1,R0 

RO 

0.125,R0 

1,R0 

RO 

RO 

-20, RO 

0,R2 

R1,R1 

80H,R2 

R2,R0 

0D5H,R0 



Save sign of number 

If R0<0x20, 

Do linear coding 
If R0>0xFFF, 

saturate the result 
Eliminate rightmost bit 

Normalize: (seg+3) OWXYZx. . .x 
Adjust segment number by 2** (-3) 
(seg)WXYZx. . .x 

Treat number as integer 
Right- justify 



If number is negative, 

set sign bit 
RO = compressed number 
Invert even bits for transmission 
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Example 12-29. A-Law Expansion 



TITL A-LAW EXPANSION 



SUBROUTINE 



AXPND 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

RO I NUMBER TO BE CONVERTED 

REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, .Rl, R2 , SP 
REGISTER CONTAINING RESULT: RO 



CYCLES: 25 (WORST CASE) 



WORDS : 16 



.global AXPND 



AXPND 



SKIPl 



XOR 


D5H,R0 


LDI 


R0,R1 


AND 


0FH,R1 


LSH 


1,R1 


LDI 


R0,R2 


LSH 


-4,R0 


AND 


7,R0 


BZ 


SKIPl 


SUBI 


1,R0 


ADD I 


32, Rl 


ADDI 


1,R1 


LSH3 


R0,R1,R0 


TSTB 


80H,R2 


RETSZ 




NEGI 


RO 


RETS 





Invert even bits 
Isolate quantization bin 
Store for bit sign 
Isolate segment code 



Create Ixxxxl 

OR Oxxxxl 
Shift and put result in RO 
Test sign bit 

Negate if a negative number 



12.4.2 FIR, ilR, and Adaptive Filters 

Digital filters are a common requirement for digital signal processing systems. 
There are two types of digital filters. Finite Impulse Response (FIR) and Infi- 
nite Impulse Response (IlR). Each of these types can have either fixed or 
adaptable coefficients. In this section, the fixed-coefficient filters are pre- 
sented first, and then the adaptive filters are discussed. 
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12.4.2. 1 FIR Filters 

If the FIR filter has an impulse response /?[0], /?[1 ],..., /7[N-1], and x[/7] re- 
presents the input of the filter at time n, the output /[/?] at time n is given by 
the equation: 

y[n] =/7[0] X[/7] + /7[1] X [/7-1] + ...+ /7[N-1] x[/7-(N-1)] 

Two features of the TMS320C30 that facilitate the implementation of the FIR 
filters, are parallel multiply/add operations and the circular addressing. The 
first one permits the performance of a multiplication and an addition in a single 
machine cycle, while the second one makes a finite buffer of length N suffi- 
cient for the data x. 

Figure 12-1 shows the arrangement of the memory locations in order to im- 
plement the circular addressing, while Example 12-30 presents the 
TMS320C30 assembly code for an FIR filter. 



impulse 

lo„ response 

address | h(N-l) 



high 
address 



h(N-2) 



h(1) 



h(0) 



oldest input 



newest input 



initial 
input samples 



xin 


-IN 


-1)] 


x[n 


-(N 


-2)1 


• 
• 


X 


(n- 


1) 


x(n) i 



final 
input samples 



x(n) 



x[n-(N-l)J 



x(n-2) 



x(n-l) 



circular 
queue 



Figure 12-1. Data Memory Organization For a FIR Filter 



In order to set up the circular addressing, the block-size register BK should 
be initialized to block length N. Also, the locations for signal x should start 
from a memory location whose address is a multiple of the smallest power of 
2 that is greater than or equal to N. For instance, if N=24, the first address for 
X should be a multiple of 32 (the lower 5 bits of the beginning address should 
be zero). To understand this requirement, look at the section describing cir- 
cular addressing. 
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In Example 12-30, the pointer to the Input sequence x, is incremented and 
assumed to be moving from an older input to a newer input. At the end of the 
subroutine, AR1 will be pointing to the position for the next input sample. 

Example 12-30. FIR Filter 

* 

* TITL FIR FILTER 

* 
* 

* SUBROUTINE FIR 
* 

* EQUATION: y(n) = h(0) * x(n) + h(l) * x(n-l) + 

* . . . + h(N-l) * x(n-(N-l) ) 

* 

* TYPICAL CALLING SEQUENCE: 



LOAD 


ARC 


LOAD 


ARl 


LOAD 


RC 


LOAD 


BK 


CALL 


FIR 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 



* ARO I ADDRESS OF h(N-l) 

* ARl I ADDRESS OF x(N-l) 

* RC I LENGTH OF FILTER - 2 (N-2) 

* BK I LENGTH OF FILTER (N) 
* 

* REGISTERS USED AS INPUT: ARO, ARl, RC, BK 

* REGISTERS MODIFIED: RO, R2 , ARO, ARl, RC 

* REGISTER CONTAINING RESULT: RO 
* 

* 

* CYCLES: 11 + (N-l) WORDS: 6 
* 

* 

.global FIR 

* ; Initialize RO : 
FIR MPYF3 * ARO++ ( 1 ) , * ARH-+ ( 1 ) % , RO 

* ; h(N-l) * x(n-(N-l)) -> RO 
LDF 0.0, R2 ; Initialize R2 . 

* 

* FILTER { 1 <= i < N) 
* 

RPTS RC ; Setup the repeat cycleE. 

MPYF3 *AR0++(1) ,*ARH-+(1)%,R0 ; h(N-l-i) *x(n- (N-l-i) ) ->R0 

I I ADDF3 R0,R2,R2 ; Multiply and add operation 

* 

ADDF R0,R2,R0 ; Add last product 
* 

* RETURN SEQUENCE 

RETS ; Return 

* 

* end 
* 

.end 
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12.4.2.2 IIR Filters 

The transfer function of the IIR filters has both poles and zeros. Its output 
depends on both the input and the past output. As a rule, they need less 
computation than an FIR with similar frequency response, but they have the 
drawback of being sensitive to coefficient quantization. Most often, the IIR 
filters are implemented as a cascade of second-order sections, called biquads. 
Examples 12-31 and 12-32 show the implementation for one biquad and for 
any number of biquads respectively. 

The equation for a single biquad is given by: 

y[n] = a^ y{nA ] + al y[n-2] + bO x[n] + b^ x\_n-^ ] + bl x[n-2] 

This equation can be implemented more conveniently by the following two 
equations which have less storage requirements: 

d[n] = al cr[A7-2]+ al £/[a7-1]+ x[/7] 
y\n\ = bl d[/7-2]+ Z?1 £y[/j-1]+ ibO dln\ 

Figure 12-2 shows the memory organization for this approach, and Example 
12-31 is an implementation of a single biquad on the TMS320C30. 





filter 
coefficients 






initial delay 
node values 




final delay 
node values 




address 


a2 


newest deiay 
oldest delay 


din) 




d(n-l) 


1 




b2 


d(n-1) 


d(n-2) 


circular queue 




a1 


d(n-2) 


din) 


1 




bl 










high 


bO 




address 

Fig 


ure12-2. 


D 


ata Memory 


Organization For a Single Biquad 




As in the case of FIR filters, the address for the start of the values d must be 
a multiple of 4, i.e., the last two bits of the beginning address must be zero. 
The block-size register BK must be initialized to 3. 
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Example 12-31. MR Filter (One Biquad) 

* 

* TITL IIR filter 
* 

* 

* SUBROUTINE I I R 1 
* 

* IIRl == IIR FILTER (ONE BIQUAD) 
* 

* 

* EQUATIONS: d(n) = a2 * d(n-2) + al * d(n-l) + x(n) 

* y(n) = b2 * d(n-2) + bl * d(n-l) + bO * d(n) 



OR 



y(n) = al*y(n-l) + a2*y(n-2) + bO*x(n) 
+ bl*x{n-l) + b2*x(n-2) 



TYPICAL CALLING SEQUENCE: 



load 


R2 


load 


ARO 


load 


ARl 


load 


BK 


CALL 


IIRl 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

R2 I INPUT SAMPLE X(N) 

ARO I ADDRESS OF FILTER COEFFICIENTS (A2) 
ARl I ADDRESS OF DELAY MODE VALUES (D(N-2)) 
BK I BK = 3 

REGISTERS USED AS INPUT: R2 , ARO, ARl, BK 
REGISTERS MODIFIED: RO, Rl, R2 , ARO, ARl 
REGISTER CONTAINING RESULT: RO 



CYCLES : 



11 WORDS; 



8 



* FILTER 

.global IIRl 
* 

IIRl MPYF3 *AR0,*AR1,R0 

* ; a2 * d(n-2) -> RO 
MPYF3 *++ARO(l) ,*AR1 — {1)%,R1 

* ; b2 * d(n-2) -> Rl 
* 

MPYF3 *++ARO(l) ,*AR1,R0 ; al * d(n-l) -> RO 

I I ADDF3 R0,R2,R2 ; a2*d (n-2 ) +x (n) -> R2 

* 

MPYF3 *++ARO(l) ,*AR1— (1)%,R0 ; bl * d(n-l) -> RO 

II ADDF3 R0,R2,R2 ; al*d(n-l) +a2*d(n-2 ) +x(n) -> R2 
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MPYF3 *++ARO(l) ,R2,R2 ; bO * d(n) -> R2 
STF R2,*AR1++(1)% 

; Store d(n) and point to d(n-l) . 

ADDF R0,R2 ; bl*d(n-l) +bO*d(n) -> R2 

ADDF R1,R2,R0 ; b2*d(n-2) +bl*d(n-l) +bO*d(n) -> RO 

RETURN SEQUENCE 

RETS ; Return 

end 

. end 

In the more general case, the MR filter will contain N>1 biquads. The 
equations for its implementation are given by the following pseudo-C lan- 
guage code: 

y[0,n] = x[n] 

for (/=0;/<N;/++){ 

dU,n] = a2[i] d[i, n-2] + a^ [/] d[ /.n-1] + /[ /-I,/?] 
/[/;/?] = b2{i\ d[hn-2]+ b^ [/] d[hn-^]+ bO[f] d[hn] 

} 

y[n] =/[N-1,n] 

The corresponding memory organization is shown in Figure 12-3, while Ex- 
ample 1 2-32 is the TMS320C30 assembly-language code. 
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low 
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Figure 12-3. Data Memory Organization For N Biquads 



The block register BK should be initialized to 3, and the beginning of each set 
of d values (i.e., dli.n], /=0...N-1) should be at an address that is a multiple 
of 4 (the last two bits zero), as stated in the case of a single biquad. 
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Example 12-32. IIR Filters (N > 1 Blquads) 

* 

* TITL IIR FILTERS (N > 1 BIQUADS) 
* 

* 

* SUBROUTINE IIR2 
* 

* 
* 

* EQUATIONS: y(0,n) = x(n) 
* 

* FOR (i = 0; i < N; i++) 

* { 

* d(i,n) = a2(i) * d(i,n-2) + al(i) * d(i,n-l) + y(i-l,n) 

* y(i,n) = b2(i) * d(i,n-2) + bl(i) * d(i,n-l) + bO(i) * d(i,n) 

* } 

* y(n) = y(N-l,n) 
* 

* TYPICAL CALLING SEQUENCE: 



load 


R2 


load 


ARC 


load 


ARl 


load 


IRO 


load 


IRl 


load 


BK 


load 


RC 


CALL 


IIR2 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 



R2 

ARO 

ARl 

BK 

IRO 

IRl 

RC 



INPUT SAMPLE x(n) 

ADDRESS OF FILTER COEFFICIENTS (a2(0)) 

ADDRESS OF DELAY NODE VALUES (d(0,n-2)) 

BK = 3 

IRO = 4 

IRl = 4*N-4 

NUMBER OF BIQUADS (N) - 2 



REGISTERS USED AS INPUT: R2 , ARO, ARl, IRO, IRl, BK, RC 
REGISTERS MODIFIED: RO , Rl, R2 , ARO, ARl, RC 
REGISTER CONTAINING RESULT: RO 

CYCLES: 23 + 6(N-1) WORDS: 17 



.global IIR2 
* 

IIR2 MPYF3 *ARO, *AR1, RO 

* ; a2(0) * d(0,n-2) -> RO 
MPYF3 *++ARO(l), *AR1 — (1)%, Rl 

* ; b2(0) * d(0,n-2) -> Rl 
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II 



MPYF3 *++ARO(l) ,*AR1,R0 ; al(0) * D(0,n-1) -> RO 
ADDF3 RO, R2, R2 ; First sum term of d(0,n). 



MPYF3 
ADDF 3 
MPYF3 
STF 



RPTB 



II 

■k 


MPYF3 
ADDF 3 


II 


MPYF3 
ADDF 3 


II 


MPYF3 
ADDF 3 


II 


MPYF3 
ADDF 3 




STF 


LOOP 


MPYF3 



*++ARO(l) ,*AR1--{1)%,R0 ;bl(0) * d(0,n-l) -> RO 
RO, R2, R2 ; Second sum term of d(0,n) . 
*++ARO(l) ,R2,R2 ;bO{0) * d(0,n) -> R2 
R2, *AR1 — (1)% 

; Store d(0,n); Point to d(0,n-2) 



LOOP 



; Loop for 1 <= i < n 



*++AR0(l) ,*++ARl(IRO) ,R0 ;a2(i) * d(i,n-2) -> RO 
R0,R2,R2 ; First sum term of y(i-l,n). 

*++AR0(l) ,*AR1— {1)%,R1 ;b2(i) * D(i,n-2) -> Rl 
R1,R2,R2 ; Second sum term of y(i-l,n). 

*++AR0(l) ,*AR1,R0 ;al(i) * d(i,n-l) -> RO 
RO, R2, R2 ; First sum of d(i,n). 

*++ARO(l) ,*AR1~(1)%,R0 ;bl(i) * d{i,n-l) -> RO 
RO , R2 , R2 ; Second sum term of d(i,n). 

R2, *AR1 — (1)% 

; Store d(i,n); point to d(i,n-2) 
*++ARO(l) , R2, R2 

; bO(i) * d(i,n) -> R2 



FINAL SUMMATION 



ADDF 
ADDF 3 


R0,R2 
R1,R2,R0 


NOP 
NOP 


*AR1— (IRl) 
*AR1— (1)% 


RETURN SEQUENCE 


RETS 




end 




. end 





First sum term of y(n-l,n) 
Second sum term of y(n-l,n) 

Return to first biquad 
Point to d(0,n-l) 



Return 



72.4.2.3 Adaptive Filters (LI\/IS Algorithm) 

There are applications in digital signal processing where a filter must be 
adapted over time to keep track of changing conditions. The book "Theory 
and Design of Adaptive Filters" by Treichler, Johnson, and Larimore (Wiley- 
Interscience, 1 987) presents the theory of adaptive filters. Although in theory 
both FIR and MR structures can be used as adaptive filters, the stability prob- 
lems and the local optimum points that the MR filters exhibit, make them less 
attractive for such an application. Hence, until further research makes II R fil- 
ters a better choice, only the FIR filter are used in adaptive algorithms of 
practical applications. 

In an adaptive FIR filter, the filtering equation takes the form: 

y[n] = h[n,0] x[n]+ /?[/7,1] x[n-^]+...+ /^^N-l] x[/7-(N-1)] 
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The filter coefficients are time-dependent. In a least-mean -squares (LMS) 
algorithm, the coefficients are updated by a formula of the form: 

h[n+^.f] = h[n,i] + p x[n-i], /=0,1 N-1 

P is a constant for the computation. The updating of the filter coefficients can 
be interleaved with the computation of the filter output so that it takes 3 cycles 
per filter tap to do both. The updated coefficients are written over the old 
filter coefficients. Example 1 2-33 shows the implementation of an adaptive 
FIR filter on the TMS320C30. The memory organization and the positioning 
of the data in memory should follow the same rules as the above FIR filter with 
fixed coefficients. 
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Example 12-33. Adaptive FIR Filter (LMS Algorithm) 

* TITL ADAPTIVE FIR FILTER (LMS ALGORITHM) 

* 

* SUBROUTINE LMS 

* LMS == LMS ADAPTIVE FILTER 



* EQUATIONS: y(n) = h(n,0)*x(n) + h(n,l)*x(n-l) + ... 

* + h(n,N-l)*x(n-{N-l) ) 

* FOR (i = 0; i < N; i++) 

* h(n+l,i)=h(n,i)+ tmuerr * x(n-i) 
* 

* TYPICAL CALLING SEQUENCE: 



load 


R4 


load 


ARO 


load 


ARl 


load 


RC 


load 


BK 


CALL 


FIR 



ARGUMENT ASSIGNMENTS: 
ARGUMENT I FUNCTION 



* 


R4 1 


* 


ARO 1 


* 


ARl 1 


& 


RC 1 


& 


BK 1 



SCALE FACTOR (2 * mu * err) 
ADDRESS OF h(n,N-l) 
ADDRESS OF x(n-(N-l)) 
LENGTH OF FILTER - 2 {N-2) 
LENGTH OF FILTER (N) 

REGISTERS USED AS INPUT: R4 , ARO, ARl, RC, BK 
REGISTERS MODIFIED: RO , Rl, R2 , ARO, ARl, RC 
REGISTER CONTAINING RESULT: RO 

PROGRAM SIZE: 10 words 

EXECUTION CYCLES: 12 + 3(N-1) 



* SETUP (i = 0) 
* 

.global LMS 
* 

LMS MPYF3 *ARO, *AR1, RO 



Initialize RO : 



h(n,N-l) * x(n-(N-l)) -> RO 
LDF 0.0, R2 ; Initialize R2 . 

; Initialize Rl: 
MPYF3 *AR1++(1)%, R4, Rl 

; x(n-{N-l)) * tmuerr -> Rl 
ADDF3 *AR0++(1), Rl , Rl 

; h(n,N-l) + x(n-(N-l)) * 
tmuerr -> Rl 
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* FILTER AND UPDATE ( 1 <= I < N) 



I I 
* 

* 

I I 
* 

LOOP 



RPTB 



LOOP 



MPYF3 *ARO — (1),AR1 
ADDF3 R0,R2,R2 



ADDF3 R0,R2,R0 
STF R1,*-AR0(1) 



Setup the repeat block. 

Filter: 
RO ; h(n,N-l-i) * x(n-(N-l-i)) -> RO 
Multiply and add operation. 



UPDATE : 

MPYF3 *AR1++(1)%,R4,R1 ;x(n,N- (N-l-i) ) * tmuerr -> Rl 
STF R1,*AR0++(1) ; Rl -> h(n+l ,N-1- ( i-1) ) 

ADDF3 *AR0++(1), Rl 



* RETURN SEQUENCE 
* 

RETS 
* 

* end 
* 

. end 



Rl 
h(n, N-l-i) + x{n- (N-l-i) )*tmuerr -> Rl 

Add last product. 

h(n,0) + x(n) * tmuerr -> h(n+l,0) 



Return 




12.4.3 Matrix-Vector Multiplication 

In matrix -vector multiplication, a K x N matrix of elements m(i,j) having K 
rows and N columns is multiplied by an N x 1 vector to produce a K x 1 result. 
The multiplier vector has elements v(j), and the product vector has elements 
p(i). Each one of the product-vector elements is computed by the expression: 

p(/)= m(/,0) v{0)+ /77(/,1) v(1 ) + ...+ /7?(/,N-1) v(N-1) / = 0,1 K-1 

This is essentially a dot product, and the matrix-vector multiplication contains 
as a special case the dot product presented in Example 12-2. In pseudo-C 
format, the computation of the matrix multiplication is expressed by: 

for (/=0;/<K;/+ + ) { 

Pin = 

for (/=0;/<N;j + -i-) 

p(/) =p(/) + m{i,jy v(j) 
} 

Figure 12-4 shows the data memory organization for matrix-vector multipli- 
cation, and Example 12-34 is the TMS320C30 assembly code to implement 
it. Note that in Example 12-34, K (number of rows) should be greater than 
and N (number of columns) should be greater than 1. 
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low 
address 



high 
address 



matrix storage 




input 
vector storage 




result 
vector storage 


m(0, 0) 




v(0) 




plO) 


m(0, 1) 


v(1) 


pdl 


• 
• 
e 




• 
• 
• 




• 
• 
• 


ni(0, N-1) 




v(N-1) 


m(1, 0) 




P(K-1| 


mil ,1) 







Figure 12-4. Data Memory Organization for Matrix-Vector Multiplication 
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Example 12-34. Matrix times a vector multiplication 

* 

* TITL MATRIX TIMES A VECTOR MULTIPLICATION 

* 

* SUBROUTINE MAT 
* 

* MAT == MATRIX TIMES A VECTOR OPERATION 
* 

* 

* TYPICAL CALLING SEQUENCE: 



load 


ARO 


load 


ARl 


load 


AR2 


load 


AR3 


load 


Rl 


CALL 


MAT 



ARGUMENT ASSIGNMENTS: 
argument | FUNCTION 

+ 



ARO 
ARl 
AR2 
AR3 
Rl 



ADDRESS OF M(0,0) 

ADDRESS OF V(0) 

ADDRESS OF P(0) 

NUMBER OF ROWS - 1 (K-1) 

NUMBER OF COLUMNS - 2 (N-2) 




REGISTERS USED AS INPUT: ARO, ARl, AR2 , AR3 , Rl 
REGISTERS MODIFIED: RO, R2 , ARO, ARl, AR2 , AR3 , IRO, 
RC, RSA, REA 



* PROGRAM SIZE: 11 
* 

* EXECUTION CYCLES: 6+10*K+K*(N-l) 



.global MAT 
* 

* SETUP 
* 

MAT LDI R1,IR0 ; number of columns-2 -> IRO 

ADDI 2, IRO ; IRO = N 

* 

* FOR (i = 0; i < K; i++) LOOP OVER THE ROWS. 
* 

ROWS LDF 0.0, R2 ; initialize R2 

MPYF3 *AR0++(1) ,*AR1++(1) ,R0 

* ; m(i,0) * v(0) -> RO 
* 

* FOR (j = 1; j < N; j++) DO DOT PRODUCT OVER COLUMNS 
* 

RPTS Rl ; multiply a row by a column. 

* 

MPYF3 *AR0++(1) ,*AR1++(1) ,R0 ;m(i,j) * v(j) -> RO 
I I ADDF3 R0,R2,R2 ; m(i,j-l) * v(j-l) + R2 -> R2 , 
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DBD AR3,R0WS ; counts the number of rows left, 

ADDF R0,R2 ; last accumulate. 

STF R2,*AR2++(1) ; result -> p(i) 

NOP * — ARl(IRO) ; set ARl to point to v(0). 
! ! ! DELAYED BRANCH HAPPENS HERE ! ! ! 

RETURN SEQUENCE 

RETS ; return 
end 

.end 



12.4.4 Fast Fourier Transforms (FFT) 

Fourier transforms are an important tool often used in digital signal processing 
systems. The purpose of the transform is to convert information from the time 
domain to the frequency domain. The inverse Fourier transform converts in- 
formation back to the time domain from the frequency domain. Implementa- 
tion of Fourier transforms that are computationally efficient are known as Fast 
Fourier Transforms (FFTs). The theory of FFTs can by found in books such 
as "DFT/FFT and Convolution Algorithms" by C.S. Burrus and T.W. Parks 
(John Wiley, 1985), and in the book "Digital Signal Processing Applications 
with the TMS320 Family". 

The TMS320C30 has many features that permit very efficient implementation 
of numerically intensive algorithms. Some of these features are particularly 
well suited for FFTs. The high speed of the device (60 ns cycle-time) makes 
easier the implementation of real-time algorithms, while the floating-point 
capability eliminates the problems associated with dynamic range. The pow- 
erful indexing scheme in indirect addressing facilitates the access of FFT but- 
terfly legs that have different spans. A construct that reduces the looping 
overhead in algorithms heavily dependent on loops (such as the FFTs), is the 
repeat block implemented by the RPTB instruction. This construct gives the 
efficiency of in-line coding, but has the form of a loop. Since the output of 
the FFT is in scrambled (bit- reversed) order when the input is in regular order, 
there is a need to restore it in the proper order. In the TMS320C30, there is 
no need to spend extra cycles for this rearrangement. The device has a special 
form of indirect addressing (bit-reversed addressing mode), that can be used 
when the FFT output is needed. This mode permits accessing the FFT output 
in the proper order. 

Fast Fourier Transform is a label for a collection of algorithms implementing 
efficient conversion from time to frequency domain. Different types of FFT 
are: 

O Radix-2 and radix-4 algorithms(depending on the size of the FFT but- 
terfly) 

O Decimation in time or frequency (DIT or DIF) 

O Complex or real FFTs 

O FFTs of different lengths, etc. 

12-63 




Software Applications - Application-Oriented Operations 



The present implementation of the FFT was based on programs contained in 
the book "DFT/FFT and Convolution Algorithms" by C.S. Burrus and T.W. 
Parks, and in the paper "Real-Valued Fast Fourier Transform Algorithms" by 
H.V. Sorensen et al. (IEEE Trans, on ASSP, June 1 987). 

Examples 12-35 and 12-36 show the implementation of a complex radix-2, 
DIF, FFT on the TMS320C30. Example 12-35 contains the generic code of 
the FFT that can be used with any length number. However, for the complete 
implementation of an FFT, a table of twiddle factors (sines/cosines) is needed, 
and this table depends on the size of the transform. To retain the generic form 
of Example 12-35, the table with the twiddle factors (containing 11/4 com- 
plete cycles of a sine) is presented separately in Example 12-36 for the case 
of a 64-point FFT. A full cycle of a sine should have a number of points equal 
to the FFT size. In Example 12-36, the FFT length N and M, which is equal 
to the logarithm of N to base equal to the radix, are defined. M is the number 
of stages of the FFT. For a 64-point FFT, M=6 when using a radix-2 algo- 
rithm or M=3 when using a radix-4 algorithm. If the table with the twiddle 
factors and the FFT code are kept in separate files, they should be connected 
at link time. 
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Example 12-35. Complex, Radix-2, DIF FFT 

* 

* TITL COMPLEX, RADIX-2, DIF FFT 
* 

* GENERIC PROGRAM FOR A LOOPED-CODE RADIX-2 FFT COMPUTATION IN 320C30 
* 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 111. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION 

* IS DONE IN-PLACE, BUT THE RESULT IS MOVED TO ANOTHER MEMORY 

* SECTION TO DEMONSTRATE THE BIT-REVERSED ADDRESSING. 
* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA SECTION. 

* THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE GENERIC 

* NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF THE FFT 

* N AND L0G2(N) ARE DEFINED IN A .GLOBL DIRECTIVE AND SPECIFIED 

* DURING LINKING. 



INP 



.globl FFT 

. globl N 

.globl M 

.globl SINE 

.usect "IN", 1024 

.BSS OUTP,1024 

.text 



Entry point for execution 

FFT size 

L0G2(N) 

Address of sine table 

Memory with input data 
Memory with output data 



INITIALIZE 



FFTSIZ 


.word 


N 


LOGFFT 


.word 


M 


SINTAB 


.word 


SINE 


INPUT 


• word 


INP 


OUTPUT 


.word 


OUTP 


FFT: 


LDP 


FFTSIZ 




LDI 


@FFTSIZ,IR1 




LSH 


-2,IR1 




LDI 


0,AR6 




LDI 


@FFTSIZ,IRO 




LSH 


1,IR0 




LDI 


(aFFTSIZ,R7 




LDI 


1,AR7 




LDI 


1,AR5 


* OUTER LOOP 




LOOP: 


NOP 


*++AR6(l) 




LDI 


§INPUT,ARO 




ADDI 


R7,AR0,AR2 




LDI 


ART , RC 




SUBI 


1,RC 



Command to load. data page pointer 



IRl=N/4, pointer for SIN/COS table 
AR6 holds the current stage number 

IR0=2*N1 (because of real/imag) 

R7=N2 

Initialize repeat counter of first loop 

Initialize IE index (AR5=IE) 



Current FFT stage 
ARO points to X(I) 
AR2 points to X(L) 

RC should be one less than desired # 
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* FIRST LOOP 








RPTB 


BLKl 






ADDF 


*AR0,*AR2,R0 


RO=X(I)+X(L) 




SUBF 


*AR2++,*AR0++,R-L R1=X(I)-X(L) 




ADDF 


*AR2,*AR0,R2 


R2=Y(I)+Y(L) 




SUBF 


*AR2,*AR0,R3 


R3=Y(I)-Y(L) 




STF 


R2,*AR0-- 


Y(I)=R2 and... 


1 1 


STF 


R3,*AR2~ 


Y{L)=R3 


BLKl 


STF 


RO,*ARO++(IRO) 


X(I)=RO and... 


II 


STF 


R1,*AR2++(IR0) 


X(L)=R1 and AR0,2 = AR0,2 + 2*n 


* IF 


THIS IS 


THE LAST STAGE, YOU ARE DONE 




CMP I 


(aL0GFFT,AR6 






BZD 


END 




* MAIN INNER 


LOOP 






LDI 


2,AR1 


Init loop counter for inner loop 




LDI 


(asiNTAB,AR4 


Initialize lA index (AR4=IA) 


INLOP 


ADD I 


AR5,AR4 


IA=IA+IE; AR4 points to cosine 




LDI 


AR1,AR0 






ADD I 


2,AR1 


Increment inner loop counter 




ADD I 


@INPUT,ARO 


(X(I) ,Y(I)) pointer 




ADD I 


R7,AR0,AR2 


(X(L) ,Y{L) ) pointer 




LDI 


AR7 , RC 






SUBI 


1,RC 


RC should be one less than desired # 




LDF 


*AR4,R6 


R6=SIN 


* SECOND LOOI 


) 






RPTB 


BLK2 






SUBF 


*AR2,*AR0,R2 


R2=X(I)-X(L) 




SUBF 


*+AR2,*+AR0,Rl 




* 






R1=Y(I)-Y{L) 




MPYF 


R2,R6,R0 


R0=R2*SIN and. . . 


1 1 


ADDF 


*+AR2,*+AR0,R3 




* 






R3=Y(I)+Y(L) 




MPYF 


R1,*+AR4(IR1) ,R 


3 ;R3 = Rl * COS and . . . 


1 1 


STF 


R3,*+AR0 


Y(I)=Y(I)+Y(L) 




SUBF 


R0,R3,R4 


R4=R1*C0S-R2*SIN 




MPYF 


R1,R6,R0 


R0=R1*SIN and. . . 


1 1 


ADDF 


*AR2,*AR0,R3 


R3=X(I)+X(L) 




MPYF 


R2,*+AR4(IR1) ,R 


3 ; R3 = R2 * COS and . . . 


1 1 


STF 


R3,*AR0++(IR0) 




* 






• X(I)=X(I)+X(L) and AR0=AR0+2*N1 




ADDF 


R0,R3,R5 


R5=R2*C0S+R1*SIN 


BLK2 


STF 


R5,*AR2++(IR0) 


; X(L)=R2*C0S+R1*SIN, incr AR2 and... 


II 


STF 


R4,*+AR2 


, Y(L)=R1*C0S-R2*SIN 




CMP I 


R7,AR1 






BNE 


INLOP 


Loop back to the inner loop 



LSH 



1,AR7 



Increment loop counter for next time 
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BRDP 


LOOPS 






LSH 


1,AR5 






LDI 


R7,IR0 






LSH 


-1,R7 




* STORE RESULT 


OUT USING BIT-R 


END: 


LDI 


@FFTSIZ,RC 




SUBI 


1,RC 






LDI 


§FFTSIZ,IRO 






LDI 


2,IR1 




LDI 


@INPUT,ARO 




LDI 


@0UTPUT,AR1 




RPTB 


BITRV 




LDF 


*+ARO{l) ,R0 


1 1 


LDF 


*AR0++(IR0)B,R1 


BITRV 


STF 


R0,*+AR1(1) 


II 


STF 


R1,*AR1++(IR1) 


SELF 


BR 


SELF 




.end 







Next FFT stage (delayed) 

IE=2*IE 

N1=N2 

N2=N2/2 



RC=N 

RC should be one less than desired # 

IRO=size of FFT=N 



Branch to itself at the end 
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Example 12-36. Table With Twiddle Factors For A 64-Point FFT 

* 

*TITL TABLE WITH TWIDDLE FACTORS FOR A 64-POINT FFT 
* 

* FILE TO BE LINKED WITH THE SOURCE CODE FOR A 64-POINT, RADIX-2 FFT, 



SINE 




.globl 
.globl 
.globl 

.set 
. set 

.data 



SINE 

N 

M 

64 
6 



.float 


0. 


,000000 


.float 


0. 


,098017 


.float 


0. 


,195090 


.float 


0. 


,290285 


.float 


0, 


,382683 


.float 


0. 


,471397 


.float 


0, 


,555570 


.float 


0. 


,634393 


.float 


0. 


,707107 


.float 


0. 


,773010 


.float 


0, 


,831470 


.float 


0, 


.881921 


.float 


0, 


.923880 


.float 


0, 


.956940 


.float 


0, 


.980785 


.float 


0, 


.995185 


COSINE 






.float 


1, 


.000000 


.float 


0, 


.995185 


.float 


0, 


.980785 


.float 


0, 


.956940 


.float 


0, 


.923880 


.float 


0, 


.881921 


.float 


0. 


.831470 


.float 


0, 


.773010 


.float 


0, 


.707107 


.float 


0, 


.634393 


.float 


0, 


.555570 


.float 


0, 


.471397 


.float 


0, 


.382683 


.float 


0, 


.290285 


.float 


0, 


.195090 


.float 


0, 


.098017 


.float 


0, 


.000000 


.float 


-0, 


.098017 


.float 


-0, 


,195090 


.float 


-0, 


.290285 


.float 


-0. 


,382683 


.float 


-0, 


.471397 


.float 


-0, 


.555570 


.float 


-0, 


.634393 


.float 


-0, 


.707107 


.float 


-0, 


.773010 
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.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
■float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 
.float 



-0.831470 

-0.881921 

-0.923880 

-0.956940 

-0.980785 

-0.995185 

-1.000000 

-0.995185 

-0.980785 

-0.956940 

-0.923880 

-0.881921 

-0.831470 

-0.773010 

-0.707107 

-0.634393 

-0.555570 

-0.471397 

-0.382683 

-0.290285 

-0.195090 

-0.098017 

0.000000 

0.098017 

0.195090 

0.290285 

0.382683 

0.471397 

0.555570 

0.634393 

0.707107 

0.773010 

0.831470 

0.881921 

0.923880 

0.956940 

0.980785 

0.995185 



The radix-2 algorithm has a great tutorial value because it is relatively easy to 
understand how the FFT algorithm functions. However, radix-4 implementa- 
tions can increase the speed of the execution by reducing the overall arith- 
metic required. Example 12-37 shows the generic implementation of a 
complex, DIF FFT in radix-4. A companion table, like the one in Example 
1 2-36, should have a value of M equal to the logN, where the base of the lo- 
garithm is four. 
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Example 12-37. Complex, Radix-4, DIF FFT 

* 

* TITL COMPLEX, RADIX-4, DIF FFT 
* 

* GENERIC PROGRAM TO DO A LOOPED-CODE RADIX-4 FFT COMPUTATION IN 

* THE TMS320C30. 
* 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 117. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY, AND THE COMPUTATION 

* IS DONE IN-PLACE. 
* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA SECTION. 

* THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE GENERIC 

* NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF THE 

* FFT N AND L0G4(N) ARE DEFINED IN A . GLOBL DIRECTIVE AND SPECIFIED 

* DURING LINKING. 
* 

* IN ORDER TO HAVE THE FINAL RESULT IN BIT-REVERSED ORDER, THE TWO 

* MIDDLE BRANCHES OF THE RADIX-4 BUTTERFLY ARE INTERCHANGED DURING 

* STORAGE. NOTE THIS DIFFERENCE WHEN COMPARING WITH THE PROGRAM IN 

* P. 117 OF THE BURRUS AND PARKS BOOK. 



, globl FFT 

, globl N 

, globl M 

globl SINE 



Entry point for execution 

FFT size 

L0G4(N) 

Address of sine table 



.usect "IN" ,INP, 1024 ; Memory with input data 
.text 



INITIALIZE 




TEMP 


• word 


$+2 


STORE 


.word 


FFTSIZ 




.word 


N 




.word 


M 




.word 


SINE 




.word 


INP 




• BSS 


FFTSIZ, 1 




.BSS 


L0GFFT,1 




.BSS 


SINTAB,! 




.BSS 


INPUT, 1 




.BSS 


STAGE , 1 




.BSS 


RPTCNT,1 




.BSS 


IEINDX,1 




.BSS 


LPCNT,1 




.BSS 


JT,1 




.BSS 


IA1,1 



Beginning of temp storage area 



FFT size 

L0G4 (FFTSIZ) 

Sine/cosine table base 

Area with input data to process 

FFT stage # 

Repeat counter 

IE index for sine/cosine 

Second- loop count 

JT counter in program, P. 117 

lAl index in program, P. 117 
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FFT: 



INITIALIZE DATA LOCATIONS 

LDP TEMP 

LDI §TEMP,ARO 

LDI ©STORE, ARl 

LDI *ARO++,RO 

STI R0,*AR1++ 

LDI *ARO++,RO 

STI R0,*AR1++ 

LDI *ARO++,RO 

STI R0,*AR1++ 

LDI *ARO,RO 

STI R0,*AR1 

LDP FFTSIZ 

LDI @FFTSIZ,RO 

LDI @FFTSIZ,IRO 

LDI §FFTSIZ,IR1 

LDI 0,AR7 

STI AR7 , @STAGE 

LSH 1,IR0 

LSH -2,IR1 

LDI 1,AR7 

STI AR7 , @RPTCNT 

LSH -2,R0 

STI AR7,@IEINDX 

ADDI 2,R0 

STI RO,@JT 

SUBI 2,R0 

LSH 1 , RO 



OUTER LOOP 



LOOP: 



LDI 

ADDI 

ADDI 

ADDI 

LDI 

SUBI 

FIRST LOOP 



©INPUT, ARC 
R0,AR0,AR1 
R0,AR1,AR2 
R0,AR2,AR3 
@RPTCNT,RC 
1,RC 



RPTB BLKl 

ADDF * + ARO , * + AR2 , R 1 

ADDF *+AR3,*+ARl,R3 

ADDF R3,R1,R6 

SUBF *+AR2,*+AR0,R4 

STF R6,*+AR0 

SUBF R3,R1 

LDF *AR2,R5 

LDF *+ARl,R7 

ADDF *AR3,*AR1,R3 

ADDF R5,*AR0,R1 

STF R1,*+AR1 



Command to load data page counter 
Xfer data from one memory to the other 



Command to load data page pointer 



©STAGE holds the current stage number 
IR0=2*N1 (because of real/imag) 
IRl=N/4, pointer for SIN/COS table 

Initialize repeat counter of first loop 

Initialize IE index 

JT=R0/2+2 

R0=N2 



ARO points to X(I) 
ARl points to X(I1) 
AR2 points to X(I2) 
AR3 points to X(I3) 

RC should be one less than desired # 



R1=Y(I)+Y(I2) 

R3=Y(I1)+Y{13) 
R6=R1+R3 

R4=Y(I)-Y{12) 

Y(I)=R1+R3 

R1=R1-R3 

R5=X(I2) 

R7=Y(I1) 

R3=X(I1)+X(I3) 

R1=X(I)+X(I2) 

Y(I1)=R1-R3 
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M 

BLKl 
II 



ADDF R3,R1,R6 

SUBF R5,*AR0,R2 

STF R6,*AR0++(IR0) 

SUBF R3,R1 

SUBF *AR3,*AR1,R6 

SUBF R7,*+AR3,R3 

STF R1,*ARH-+{IR0) 

SUBF R6,R4,R5 

ADDF R6,R4 

STF R5,*+AR2 

STF R4,*+AR3 

SUBF R3,R2,R5 

ADDF R3,R2 

STF R5,*AR2++(IR0) 

STF R2,*AR3++(IR0) 



R6=R1+R3 

R2=X(I)-X(I2) 
; X(I)=R1+R3 

R1=R1-R3 

R6=X(I1)-X(I3) 

-R3=Y(I1)-Y(I3) 
;X(I1)=R1-R3 

R5=R4-R6 

R4=R4+R6 

Y(I2)=R4-R6 

Y(I3)=R4+R6 

R5=R2-R3 

R2=R2+R3 
; X(I2)=R2-R3 
; X(I3)=R2+R3 



IF THIS IS THE LAST STAGE, YOU ARE DONE 



LDI 
ADD I 
CMP I 
BZD 
STI 



@STAGE,AR7 

1,AR7 

§L0GFFT,AR7 

END 

AR7 , @STAGE 



MAIN INNER LOOP 



INLOP : 




LDI 
STI 
LDI 
STI 

LDI 
ADD I 
LDI 
LDI 
ADD I 
ADD I 
STI 
ADD I 
STI 
ADD I 
ADD I 
LDI 
SUBI 
CMP I 
BZD 
LDI 
LDI 
ADD I 
ADD I 
SUBI 
ADD I 
SUBI 



1,AR7 

AR7,@IA1 

2,AR7 

AR7 , §LPCNT 

2,AR6 

@LPCNT,AR6 

@LPCNT,ARO 

@IA1,AR7 

@IEINDX,AR7 

§INPUT,ARO 

AR7,@IA1 

R0,AR0,AR1 

AR6 , @LPCNT 

R0,AR1,AR2 

R0,AR2,AR3 

@RPTCNT,RC 

1,RC 

@JT,AR6 

SPCL 

@IA1,AR7 

(aiAl,AR4 

@SINTAB,AR4 

AR4,AR7,AR5 

1,AR5 

AR7,AR5,AR6 

1,AR6 



Current FFT stage 

Init lAl index 

Init loop counter for inner loop 

Increment inner loop counter 



IA1=IA1+IE 

(X(I) ,Y(I) ) pointer 

(X(I1) ,Y(I1) ) pointer 

(X(I2) ,Y(I2) ) pointer 
(X(I3) ,Y(I3) ) pointer 

RC should be one less than desired # 
If LPCNT=JT, go to 
special butterfly 



Create cosine index AR4 

IA2=IA1+IA1-1 

IA3=IA2+IA1-1 
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II 



II 



I I 



RPTB 
ADDF 

ADDF 

ADDF 
SUBF 

SUBF 

ADDF 

ADDF 

MPYF 

STF 

ADDF 

SUBF 

SUBF 

MPYF 

STF 

SUBF 

SUBF 

MPYF 

STF 

MPYF 

ADDF 

ADDF 

SUBF 

SUBF 

SUBF 

ADDF 

MPYF 

STF 

MPYF 

SUBF 

MPYF 

STF 

MPYF 

ADDF 

MPYF 

STF 

MPYF 

SUBF 

MPYF 

STF 

MPYF 

ADDF 



SECOND LOOP 

BLK2 
*+AR2,*+AR0,R3 

*+AR3,*+ARl,R5 

R5,R3,R6 ; 
*+AR2,*+AR0,R4 

R5,R3 ; 

*AR2,*AR0,R1 ; 
*AR3,*AR1,R5 



R3=Y(I)+Y(I2) 

R5=Y(I1)+Y(I3) 
R6=R3+R5 

R4=Y(I)-Y(I2) 
R3=R3-R5 
R1=X(I)+X(I2) 
R5=X(I1)+X(I3) 



R3,*+AR5(IR1) ,R6 R6=R3*C02 



R6,*+AR0 

R5,R1,R7 

*AR2 , *ARO , R2 ; 

R5,R1 ; 

R1,*AR5,R7 ; 

R7,*AR0++(IR0) 

R7,R6 

*+AR3,*+ARl,R5 

; R5=Y(I1)-Y(I3) 
R1,*+AR5(IR1) ,R7 ;R7=R1*C02 



Y(I)=R3+R5 
R7=R1+R5 
R2=X(I)-X(I2) 
R1=R1-R5 
R7=R1*SI2 
; X(I)=R1+R5 
R6=R3*C02-R1*SI2 



R6,*+AR1 

R3,*AR5,R6 

R7,R6 

R5,R2,R1 

R5,R2 

*AR3,*AR1,R5 

R5,R4,R3 

R5,R4 

R3,*+AR4(IR1) ,R6 

R6,*AR1++(IR0) ; 



Y(I1)=R3*C02-R1*SI2 

R6=R3*SI2 

R6=R1*C02+R3*SI2 

R1=R2+R5 

R2=R2-R5 

R5=X(I1)-X(I3) 

R3=R4-R5 

R4=R4+R5 

R6=R3*C01 
X(I1)=R1*C02+R3*SI2 



R1,*AR4,R7 ; R7=R1*SI1 

R7,R6 ; R6=R3*C01-R1*SI1 

R1,*+AR4(IR1) ,R6 ; R6=R1*C01 

R6,*+AR2 ; Y ( 12 ) =R3*C01-R1*SI1 

R3,*AR4,R7 ; R7=R3*SI1 

R7,R6 ; R6=R1*C0H-R3*SI1 

R4,*+AR6(IR1) ,R6 ; R6=R4*C03 

R6,*AR2++(IR0) ; X( 12) =R1*C01+R3*SI1 

R2,*AR6,R7 ; R7=R2*SI3 

R7,R6 ; R6=R4*C03-R2*SI3 

R2,*+AR6(IR1) ,R6 ; R6=R2*C03 

R6,*+AR3 ; y { ±3 ) =R4*C03-R2*SI3 

R4,*AR6,R7 ; R7=R4*SI3 

R7,R6 ; R6=R2*C03+R4*SI3 
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BLK2 



STF R6,*AR3++(IR0) 



CMP I @LPCNT,RO 
BP INLOP 
BR CONT 

SPECIAL BUTTERFLY FOR W=J 



; x(i3)=R2*C03+R4*SI3 

; LOOP BACK TO THE INNER LOOP 




SPCL 


LDI 


IR1,AR4 






LSH 


-1,AR4 


Point to SIN (45) 




ADD I 


@SINTAB,AR4 


; Create cosine index 1 




RPTB 


BLK3 






ADDF 


*AR2,*AR0,R1 


; R1=X(I)+X(I2) 




SUBF 


*AR2,*AR0,R2 


R2=X(I)-X(I2) 




ADDF 


*+ar2,*+aro,r: 


! 


* 






R3=Y(I)+Y(I2) 




SUBF 


*+AR2,*+ARO,R4 


I 


* 






• R4=Y(I)-Y(I2) 




ADDF 


*AR3,*AR1,R5 


R5=X(I1)+X(I3) 




SUBF 


R1,R5,R6 


, R6=R5-R1 




ADDF 


R5,R1 


R1=RH-R5 




ADDF 


*+AR3,*+ARl,R5 




* 






R5=Y{I1)+Y(I3) 




SUBF 


R5,R3,R7 


R7=R3-R5 




ADDF 


R5,R3 


R3=R3+R5 




STF 


R3,*+AR0 


Y(I)=R3+R5 


1 1 


STF 


R1,*AR0++(IR0) 


; X(I)=RH-R5 




SUBF 


*AR3,*AR1,R1 


R1=X(I1)-X(I3) 




SUBF 


*+ar3,*+ari,r: 




* 






R3=Y(I1)-Y(I3) 




STF 


R6,*+AR1 


Y(I1)=R5-R1 


1 1 


STF 


R7,*AR1++(IR0) 


; X(I1)=R3-R5 




ADDF 


R3,R2,R5 


R5=R2+R3 




SUBF 


R2,R3,R2 


R2=-R2+R3 




SUBF 


R1,R4,R3 


R3=R4-R1 




ADDF 


R1,R4 


R4=R4+R1 




SUBF 


R5,R3,R1 


R1=R3-R5 




MPYF 


*AR4,R1 


R1=R1*C021 




ADDF 


R5,R3 


R3=R3+R5 




MPYF 


*R4,R3 


R3=R3*C021 


II 


STF 


R1,*+AR2 


Y(I2)=(R3-R5)*C021 




SUBF 


R4,R2,R1 


R1=R2-R4 




MPYF 


*AR4,R1 


R1--R1*C021 


II 


STF 


R3,*AR2++(IR0) 


; X(I2)=(R3+R5)*C021 




ADDF 


R4,R2 


R2=R2+R4 




MPYF 


*AR4 , R2 


R2=R2*C021 


BLK3 


STF 


R1,*+AR3 


Y(I3)=-(R4-R2)*C021 


II 


STF 


R2,*AR3++(IR0) 


; X(I3)=(R4+R2)*C021 




CMP I 


§LPCNT,RO 






BPD 


INLOP 


Loop back to the inn« 
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CONT 



LDI 
LDI 
LSH 

STI 

LSH 

STI 

LDI 

LSH 

ADD I 

STI 

SUBI 

LSH 

BR 



@RPTCNT,AR7 
@IEINDX,AR6 
2,AR7 

AR7 , @RPTCNT 

2,AR6 

AR6,@IEINDX 

RO,IRO 

-3,R0 

2,R0 

RO,@JT 

2,R0 

1,R0 

LOOP 



Increment repeat counter for 
next time 

IE=4*IE 

N1=N2 



JT=N2/2+2 

N2=N2/4 

Next FFT stage 



STORE RESULT OUT USING BIT-REVERSED ADDRESSING 



END: LDI 
SUBI 
LDI 
LDI 
LDI 
LDP 
LDI 

RPTB 

LDF 

I I LDF 

BITRV STF 

I I STF 



SELF 



BR 
. end 



§FFTSIZ,RC 

1,RC 

§FFTSIZ,IRO 

2,IR1 

©INPUT, ARO 

STORE 

(aSTORE,ARl 



BITRV 

*+ARO(l) ,R0 
*AR0++(IR0)<,R1 
R0,*+AR1(1) 
R1,*ARH-+(IR1) 

SELF 



RC=N 

RC should be one less than desired # 

IRO=size of FFT=N 



Branch to itself at the end. 



Most often, the data to be transformed is a sequence of real numbers. In this 
case, the FFT demonstrates certain symmetries that permit the reduction of the 
computational load even further. Example 12-38 shows the generic imple- 
mentation of a real-valued, radix-2 FFT. For such an FFT, the total number 
of storage required for a length- N transform is only N locations instead of 2N 
that are necessary in a complex FFT. The rest of the points can be recovered 
based on the symmetry conditions. 
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Example 12-38. Real, Radlx-2 FFT 

* 

* TITL REAL, RADIX-2 FFT 
* 

* GENERIC PROGRAM TO DO A RADIX-2 REAL FFT COMPUTATION IN 320C30. 
* 

* THE PROGRAM IS TAKEN FROM THE PAPER BY SORENSEN ET AL., JUNE 1987 

* ISSUE OF THE TRANSACTIONS ON ASSP . 
* 

* THE REAL DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION IS 

* DONE IN-PLACE. THE BIT-REVERSAL IS DONE AT THE BEGINNING OF 

* THE PROGRAM. 
* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA 

* SECTION. THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE 

* THE GENERIC NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE 

* SIZE OF THE FFT N AND L0G2(N) ARE DEFINED IN A .GLOBL DIRECTIVE 

* AND SPECIFIED DURING LINKING. THE LENGTH OF THE TABLE IS 

* N/4 + N/4 = N/2. 






.globl 
.globl 
.globl 
.globl 


FFT 
N 
M 
SINE 




.bss 


INP,1024 




.text 




* INITIALIZE 




FFTSIZ 


.word 


N 


LOGFFT 


.word 


M 


SINTAB 


.word 


SINE 


INPUT 


.word 


INP 


FFT: 


LDP 


FFTSIZ 


* DO THE BIT-REVERSING AT 




LDI 


@FFTSIZ,RC 




SUBI 


1,RC 




LDI 


©FFTSIZ, IRO 




LSH 


-1,IR0 




LDI 


@INPUT,ARO 




LDI 


@INPUT,AR1 




RPTB 


BITRV 




CMP I 


AR1,AR0 




BGE 


CONT 




LDF 


*ARO,RO 


II 


LDF 


*AR1,R1 




STF 


R0,*AR1 


1 1 


STF 


R1,*AR0 



Entry point for execution 

FFT size 

L0G2(N) 

Address of sine table 

Memory with input data 



Command to load data page printer 



RC=N 

RC should be one less than desired # 

IRO=half the size of FFT=N/2 



Exchange locations only 
if AR0<AR1 
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CONT 


NOP 


*ARO++ 




BITRV 


NOP 


*AR1++(IR0)B 




* LENGTH-TWO 


BUTTERFLIES 






LDI 


@ INPUT, ARO 


; ARO points to X(I) 




LDI 


IRO,RC 


; Repeat N/2 times 




SUBI 


1,RC 


; RC should be one less than desired # 




RPTB 


BLKl 






ADDF 


*+ARO,*ARO++,RO 


* 






R0=X(I)H-X(I+1) 




SUBF 


*AR0,*-AR0,R1 




* 






• R1=X(I)-X(I+1) 


BLKl 


STF 


RO,*-ARO 


• X(I)=X(I)+X(I+1) 


II 


STF 


R1,*AR0++ 


• X(H-1)=X(I)-X(I+1) 


* FIRST PASS 


OF THE DO- 20 LOOP (STAGE K=2 IN DO- 10 LOOP) 




LDI 


§INPUT,ARO 


ARO points to X(I) 




LDI 


2,IR0 


IR0=2=N2 




LDI 


QFFISIZ ,RC 






LSH 


-2,RC 


Repeat N/4 times 




SUBI 


1,RC 


RC should be one less than desired # 




RPTB 


BLK2 






ADDF 


*+ARO(IRO) ,*AR0++(IR0) ,R0 


* 






R0=X(I)+X(I+2) 




SUBF 


*aro,*-aro{ir6) ,R1 


* 






Rl=X(I)-X(I+2) 




NEGF 


*+ARO,RO 


R0=-X(I+3) 


1 1 


STF 


R0,*-AR0(IR0) 


X(I)=X(I)+X(I+2) 


BLK2 


STF 


Rl,*ARO++(IRO) 




* 






X(I+2)=X(I)-X(I+2) 


II 


STF 


RO,*+ARO 


X(I+3)=-X(I+3) 


* MAIN 


LOOP 


(FFT STAGES) 






LDI 


@FFTSIZ,IRO 






LSH 


-2,IR0 


IRO=index for E 




LDI 


3,R5 


R5 holds the current stage number 




LDI 


1,R4 


R4=N4 




LDI 


2,R3 


R3=N2 


LOOP 


LSH 


-1,IR0 


E=E/2 




LSH 


1,R4 


N4=2*N4 




LSH 


1,R3 


N2=2*N2 


* INNER LOOP 


(DO-20 LOOP IN ' 


CHE PROGRAM) 




LDI 


§INPUT,AR5 


AR5 points to X(I) 


INLOP 


LDI 


IRO,ARO 






ADDI 


@SINTAB,ARO 


ARO points to SIN/COS table 




LDI 


R4,IR1 


IR1=N4 




LDI 


AR5,AR1 






ADDI 


1,AR1 


ARl points to X(I1)=X(I+J) 




LDI 


AR1,AR3 
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ADD I 
LDI 
SUBI 
ADD I 


■k 


LDF 


•It 


ADDF 




SUBF 


II 


STF 

NEGF 

NEGF 


II 


STF 
STF 



R3,AR3 ; AR3 points to X( 13 ) =X( I+J+N2) 

AR3,AR2 

2,AR2 ; AR2 points to X( 12) =X( I-J+N2) 

R3,AR2,AR4 ; AR4 points to X ( 14 )=X(I-J+N1) 

*AR5++(IR1) ,R0 

; RO=X(I) 
*+AR5(IRl) ,R0,R1 

; R1=X(I)+X(I+N2) 
R0,*++AR5(IR1) ,R0 

; R0=-X(I)+X(I+N2) 

R1,*-AR5(IR1) ; X ( I ) =X ( I ) -HX ( H-N2 ) 

RO ; R0=X(I)-X(I+N2) 

*++AR5(IRl) ,R1 

; R1=-X(I+N4+N2) 

R0,*AR5 ; X( I+N2) =X(I ) -X( I+N2) 

R1,*AR5 ; X(I+N4+N2)=-X(I+N4+N2) 



* INNERMOST LOOP 



II 



BLK3 




END 



LDI 


§FFTSIZ,IR1 




LSH 


-2,IR1 


; 


LDI 


R4,RC 




SUBI 


2,RC 


; 


RPTB 


BLK3 




MPYF 


*AR3,*+AR0(IR1) 


MPYF 


*AR4,*AR0,R1 


' 


MPYF 


*AR4,*+AR0(IR1) 


ADDF 


R0,R1,R2 


t 


MPYF 


*AR3,*AR0++{IR0 


SUBF 


R0,R1,R0 


t 


SUBF 


*AR2,R0,R1 




ADDF 


*AR2,R0,R1 




STF 
ADDF 


R1,*AR3++ 
*AR1,R2,R1 




STF 


R1,*AR4 — 




SUBF 


R2,*AR1,R1 




STF 


R1,*AR1++ 




STF 


R1,*AR2 — 




SUBI 


©INPUT, AR5 




ADD I 


R3,AR5 


• 


CMP I 


@FFTSIZ,AR5 




BLED 


INLOP 


• 


ADD I 


@INPUT,AR5 




NOP 






NOP 






ADD I 


1,R5 




CMP I 


@L0GFFT,R5 




BLE 


LOOP 




BR 


END 


. 


.end 







IRl=separation between SIN/COS tbls 
Repeat N4-1 times 



,R0 

RO=X(I3)*COS 

R1=X(I4)*SIN 
,R1 ; R1=X(I4)*C0S 

R2=X ( 13 ) *COS+X ( 14 ) *SIN 
),R0 

R0=X(I3)*SIN 

RO=-X ( 13 ) *SIN+X ( 14 ) *COS 

R1=-X(I2)+R0 

R1=X(I2)+R0 

X(I3)=-X(I2)+R0 

R1=X(I1)+R2 

X(I4)=X(I2)+R0 

R1=X(I1)-R2 

X(I1)=X(I1)+R2 

X(I2)=X{I1)-R2 



AR5 = i-l-Nl 

Loop back to the inner loop 



Branch to itself at the end, 
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Table 12-1 summarizes the execution time required for FFT lengths between 
64 and 1024 points for the three algorithms in Examples 12-35, 12-37, and 
12-38. As can be seen, the TMS320C30 permits very fast execution of such 
transforms. FFT lengths up to 1024 points (complex) or 2048 points (real), 
covering the majority of applications, can be executed almost entirely in the 
on-chip memory. 

Table 12-1. TMS320C30 FFT Timing Benchmarks 



NUMBER 




FFT TIMING 




OF 




(in milliseconds) 




POINTS 


RADIX-2 


RADIX-4 


RADIX-2 




(complex) 


(complex) 


(real) 


64 


0.167 


0.123 


0.075 


128 


0.367 


- 


0.162 


256 


0.801 


0.624 


0.354 


512 


1.740 


- 


0.771 


1024 


3.750 


3.040 


1.670 



12.4.5 Lattice Filters 



The lattice form is an alternative way of implementing digital filters, and it has 
found applications in speech processing, spectral estimation, and other areas. 
In the present discussion, the notation and the terminology from speech pro- 
cessing applications will be used. 

If H(z) is the transfer function of a digital filter that has only poles, A(z) = 
1/H(z) will be a filter having only zeros and it will be called the inverse filter. 
The inverse lattice filter is shown in Figure 1 2-5. In mathematical terms, it is 
described by the equations: 

f{hn) = /(/-I ,n) + k{i) b{i-^ ,n-^ ) 
b^n) = b{i-^,n-^)+ k{i) f(/-1,/7) 

Initial conditions: 

f{0,n) = b{0,n) = x{n) 
Final conditions: 
y{n) = f{p,n). 



f{i,n) is called the forward error, b{/',n) backward error, k{/) is the /-th re- 
flection coefficient, x{n) is the input, and y{n) the output signal. The order 
of the filter (i.e., the number of stages) is p. In the linear predictive coding 
(LPC) method of speech processing, the inverse lattice filter is used during 
analysis, and the (forward) lattice filter during speech synthesis. 
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x(n) f(1, n) 



f(2, n) 



f(p-1,n) 



f(p. n)=y|n) 



b(1, n) 



K1' 
K1/ 



b(2, n) 



K2' 
K2, 



b(p-l, n) 



Figure 12-5. Structure of the Inverse Lattice Filter 



Figure 12-6 shows the data memory organization of the inverse lattice-filter 
on the TMS320C30. 



low 
address 


reflection 
coefficients 




backward 
propagation terms 


k(1) 




b(0, n-1) 




k(2) 


b(1,n-1) 




• 
• 
• 




• 
• 

e 


high 
address 


kip) 


b(p-1, n-1) 



Figure 12-6. Data Memory Organization for Lattice Filters 
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Example 12-39. inverse Lattice Filter 

* TITL INVERSE LATTICE FILTER 
* 

* 

* SUBROUTINE L A T I N V 
* 

* LATINV == LATTICE FILTER (LPC INVERSE FILTER - ANALYSIS) 

* 

* TYPICAL CALLING SEQUENCE: 
* 

* load R2 

* load ARO 

* load ARl 

* load RC 

* CALL LATINV 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 



R2 

ARO 

ARl 

RC 



f(0,n) = x(n) 

ADDRESS OF FILTER COEFFICIENTS k(l)) 

ADDRESS OF BACKWARD PROPAGATION 

VALUES (b(0,n-l)) 
RC = p - 2 



REGISTERS USED AS INPUT: R2 , ARO, ARl, RC 

REGISTERS MODIFIED: RO, Rl, R2 , R3 , RS, RE, RC, ARO, ARl 

REGISTER CONTAINING RESULT: R2 (f(p,n)) 



PROGRAM SIZE: 10 WORDS 
EXECUTION CYCLES: 13 + 3 * (p-1) 



.global LATINV 



LATINV MPYF3 *ARO , *ARl, RO 

k{l) * b(0,n-l) -> RO 



LDF R2,R3 



Assume f(0,n) -> R2 . 

Put b{0,n) = f (0,n) -> R3 



MPYF3 *AR0++(1) ,R2,R1 

; k(l) * f(0,n) -> Rl 
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2 <= i <= p 

RPTB 

MPyF3 

ADDF3 



LOOP 
*AR0,*++AR1(1) ,R0 



k(i) * b(i-l,n-l) -> RO 



R2,R0,R2 



f (i-l-l,n)+k(i-l) 
*b(i-l-l,n-l) 
= f(i-l,n) -> R2 



I I 
* 

LOOP 



ADDF3 
STF 



; b(i-l-l,b-l)+k(i-l)*f (i-l-l,n) 
*-ARl(l), Rl, R3 ; = b(i-l,n) -> R3 
R3, *-ARl(l) ; b(i-l-l,n) -> b(i-l-l,n-l) 



MPYF3 *AR0++(1) ,R2,R1 



k(i) 



f(i-l,n) -> Rl 



I = P+1 (CLEANUP) 

ADDF3 R2,R0,R2 

ADDF35 ARl, Rl, R3 
STF R3, *AR1 

RETURN SEQUENCE 

RETS 

end 

.end 



f (p-l,n)+k{p)*b(p-l,n-l) 
= f(p,n) -> R2 

b(p-l,n-l)+k(p)*f (p-l,n) 

= b(p,n) -> R3 
b(p-l,n) -> b(p-l,n-l) 



RETURN 




The (forward) lattice filter has a structure very similar to the inverse filter, as 
shown in Figure 12-7. The corresponding equations describing the lattice 
filter are: 

f(/-1,/7) = f{i,n) - k{i) b{/-^,n-^) 
b{hn) = b{i-^,n-^)+ k{i) f{ i-^,n) 

Initial conditions: 

f{p,n) = x{ n), b{i,n-^ ) = for /=1 ,...,p 
Final conditions: 

y{n) = f{0,n). 

The data memory organization is identical to the one of the inverse filter, as 
shown in Figure 12-6. Example 12-40 is the implementation of the lattice 
filter on the TMS320C30. 
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x(n) = f(p, n) 



f(2,n) 



f(1,n) 



V(n) 



-Kp^ 
Kp 



-K2' 
K2, 



b(2.n) 



K1 



(b(1, n) 



Figure 12-7. Structure of the (Forward) Lattice Filter 
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Example 12-40. Lattice Filter 

* TITL LATTICE FILTER 
* 

* 

* SUBROUTINE L A T I C E 
* 

* LOAD ARO 

* LOAD ARl 

* LOAD RC 

* CALL L AT ICE 



ARGUMENT ASSIGNMENTS: 

ARGUMENT | FUNCTIONf unction 
+ 

R2 |F(P,N)=E(N)= EXCITATION 

ARO I ADDRESS OF FILTER COEFFICIENTS (K(P)) 

ARl I ADDRESS OF BACKWARD PROPAGATION VALUES (B(P-1,N-1)) 

RC I RC = P - 2 

REGISTERS USED AS INPUT: R2 , ARO, ARl, RC 

REGISTERS MODIFIED: RO , Rl, R2 , R3 , RS , RE, RC, ARO, ARl 

REGISTER CONTAINING RESULT: R2 (f(0,n)) 

STACK USAGE: NONE 

PROGRAM SIZE: 12 WORDS 

EXECUTION CYCLES: 13 + 5 * (P-1) 

.global LATICE 



LATICE MPYF3 
* 

SUBF3 
* 
* 
* 

* 2 <= I <= P 
* 

RPTB 
MPYF3 
MPYF3 
* 

ADDF3 




LOOP 



STF 
SUBF3 



*ARO, *AR1, RO 
R0,R2,R2 



LOOP 
*AR0,R2,R1 



K(P) * B(P-1,N-1) -> RO 
ASSUME F(P,N) -> R2. 
F(P,N)-K(P)*B(P-1,N-1) 
=F(P-1,N) -> R2 



; K(I) * F(I-1,N) -> Rl 
*~AR0(1), *-ARl(l), RO 

; K(I-l) * B(I-1-1,N-1) -> R 
*AR1~{1) , Rl, R3 

; B{I-1,N-1) + K(I) * F(I-1,N) 

; = B(I,N) -> R3 
R3, *+ARl(2) ; B(I,N) -> B(I,N-1) 
R0,R2,R2 ; F ( I-1,N) -K(I-l) *B (I-l-l ,N-1) 
= F(I-1-1,N) -> R2 
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* 1=1 (CLEANUP) 



MPYF3 *ARO, R2, Rl 
ADDF3 *AR1, Rl, R3 



II 



STF 
STF 



R3, *+ARl(l) 
R2 , *AR1 



RETURN SEQUENCE 

RETS 
end 

. end 



K(l) * F(0,N) -> Rl 
B(0,N-1) + K{1) * F(0,N) 

= B(1,N) -> R3 
B(1,N) -> B(1,N-1) 
F(0,N) -> B(0,N-1) 



RETURN 
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12.5 Programming Tips 



Programming style is highly personal, and reflects each individual's prefer- 
ences and experiences. The purpose of this section is not to impose any par- 
ticular style. Instead, it intends to emphasize some of the features of the 
TMS320C30 that can help in producing faster and/or shorter programs. The 
following covers both C compiler and assembly language programming. 




12.5.1 C-Cailable Routines 

The TMS320C30 was designed with a high-level language (HLL) in mind. 
The large register file, the software stack and the large memory space makes 
implementation of a HLL compiler an easy task. The first such implementation 
supplied is a C compiler. Use of the C compiler increases the transportability 
of applications that have been tested on large, general-purpose computers, 
and decreases their porting time. 

For best usage of the compiler: 

1 ) Write the application in the high-level language. 

2) Debug the program. 

3) Estimate if it runs in real-time. 

4) If not, identify places where most of the execution time is spent. 

5) Optimize these areas by writing assembly language routines imple- 
menting the functions. 

6) Call the routines from the C program as C functions. 

When writing a C program, a simple way to increase the execution speed is 
to maximize the use of register variables. For more information, refer to the 
TMS320C30 C Compiler Reference Guide. 

There are certain conventions that have to be observed in writing a C-callable 
routine. These conventions are outlined in the Runtime Environment chapter 
of the "TMS320C30 C Compiler Reference Guide". Certain registers are saved 
by the calling function and others need to be saved by the called function. 
The C compiler manual will help achieve a very clean interface. The end result 
is the readability and natural flow of a high-level language combined with the 
efficiency and special -feature use of assembly language. 

12.5.2 Hints for Assembly Coding 

Each program will have its particular requirements. Not all possible optimiza- 
tions will make sense in every case. The suggestions presented in this section 
can be used as a checklist of available software tools. 

• Use delayed branches. Delayed branches take a single cycle to ex- 
ecute, regular branches take four. The following three instructions are 
also executed no matter if the branch is taken or not. If there are less 
than three instructions that could be used, use the delayed branch and 
append NOPs. Machine cycles (time) are still being saved. 

• Apply the repeat single/block construct. In this way, loops are 
achieved with no overhead. Nesting such constructs normally will not 
increase efficiency, so try to use the repeat feature on the most often 
performed loop. Note that RPTS is not interruptible, and the executed 
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instruction is not refetched for execution. This frees the buses for op- 
erands. 

O Use parallel instructions. It is possible to have a multiply in parallel 
with an add (or subtract), and stores in parallel with any multiply or 
ALU operation. This increases the number of operations executed in a 
single cycle. For maximum efficiency, observe the addressing modes 
used in parallel instructions and arrange the data appropriately. 

O Maximize the use of registers. The registers are a very efficient, 
easy way to access scratch -pad memory. Extensive use of the register 
file will also help when using parallel instructions and in avoiding the 
pipeline conflicts encountered using the registers in addressing modes. 

O Use the cache. Especially in conjunction with external slow memory. 
The cache is transparent to the user, so make sure that it is enabled. 

o Use internal memory instead of external memory. The internal 
memory (2K x 32 bits RAM and 4K x 32 bits ROM) is considerably 
faster to access. In a single cycle, two operands can be brought from 
internal memory. A way of maximizing performance is to use the DMA 
in parallel with the CPU to transfer data to internal memory before op- 
erating on them. 

O Avoid pipeline conflicts. If there is no problem with program speed, 
ignore this suggestion. For time-critical operations, make sure that cy- 
cles are not missed because of conflicts. The way to identify such 
conflicts is to run the trace function on the development tools (simula- 
tor, emulators) with the program tracing option enabled. The tracing 
will identify immediately the pipeline conflicts. Consulting the appro- 
priate section of this User's Guide will explain the reason for the con- 
flict. Steps can then be taken to correct the problem. 

The above checklist is not exhaustive, and it does not address the more de- 
tailed features outlined in the different sections of this manual. To exploit the 
full power of the TMS320C30 it is recommended that the architecture, hard- 
ware configuration, and instruction set of the device, described in earlier 
chapters, be carefully studied. 
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Section 13 

Hardware Applications 



The TMS320C30's advanced interface design allows this device to be used to 
implement a wide variety of system configurations. Its two external buses and 
DMA capability provide a parallel 32-bit interface to byte- or word-wide de- 
vices, while the interrupt interface, dual serial ports, and general purpose dig- 
ital I/O provide communication with a multitude of peripherals. 

This section describes how to use the TI\/IS320C30's interfaces to connect to 
various external devices. Specific discussions include implementation of par- 
allel interface to devices with and without wait states, use of DMA and general 
purpose I/O, and multiprocessing considerations. 

Major topics discussed in this section are as follows: 

O System Configuration Options Overview (Section 13.1 on page 13-2) 

O Primary Bus Interface (Section 13.2 on page 13-4) 

- Zero Wait State Interface to RAMs 

- Ready Generation 

- Bank Switching Techniques 

O Expansion Bus Interface (Section 13.3 on page 13-14) 

O System Control Functions (Section 13.4 on page 13-18) 

- Clock Oscillator Circuitry 

- Reset Sigjial Generator 

O User Target Design Considerations When Using the XDS1000 (Section 
13.5 on page 13-22) 
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13.1 System Configuration Options Overview 

The various TMS320C30 interfaces allow connections to a wide variety of 
different device types. Each of these interfaces is tailored to a particular family 
of devices. 

13.1 .1 Categories of Interfaces on the TMS320C30 

The interface types on the TMS320C30 fall into several different categories 
depending on the devices to which they were intended to be connected. Each 
interface comprises one or more signal lines which transfer information and 
control its operation. Shown in Figure 13-1 are the signal line groupings for 
each of these various interfaces. 



Primary 
Bus 



< 



System 
Control 



< 



Data ■*- 



32 



Address •^ — ^ 



24 



Control < ■^- 



External / 

DMA < 
Interface ^ ^ 

Interrupt / 
Interface ^ ^ 



\ > - 



External 
Flags 

System Reset 
ROM Enable 



_v2 



Master 
Clock 



<: 



Clock J '^~ 
Outputs T^.^_ 



DO-31 
AO-23 

R/W 

STRB 

RDY 

HOLD 



HOLDA 



INTO-3 
JACK 

XFO-1 



RESET 

MC/MP 

XI 

X2/CLKIN 

HI 

H3 



XDO-31 
XAO-12 

XR/W 



XRDY 



lOSTRB 



MSTRB 

TCLKO-1 

CLKXO-1 
DXO-1 
FSXO-1 

CLKRO-1 

DRO-1 

FSRO-1 



TMS320C30 



32 



13. 










t^V 




^2. 




2v 




\ — 


.,2v 




^2, 




-2s 



^ Data 



Address 



Control 



Timer Interface 



v^ Expansion 
< Bus 



Dual 

^ Serial 

Ports 




Figure 13-1 . External Interfaces on the TMS320C30 



All of the interfaces are independent of one another and different operations 
may be performed simultaneously on each interface. 

The Primary and Expansion buses implement the memory mapped interface to 
the device. The external DMA interface allows external devices to cause the 
processor to relinquish the Primary bus and allow direct memory access. 
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13.1.2 Typical System Block Diagram 

The devices which can be interfaced to the TI\/1S320C30 include memory, 
DMA devices, and numerous parallel and serial peripherals and I/O devices. 
Figure 13-2 illustrates a typical configuration of a TMS320C30 system 
showing different types of external devices and the interfaces to which they 
are connected. 
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Figure 13-2. Possible System Configurations 



This block diagram constitutes more or less a fully expanded system. In an 
actual design any subset of the illustrated configuration may be used. 
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13.2 Primary Bus Interface 

The primary bus is used by the Ti\/IS320C30 to access the majority of its me- 
mory mapped locations. Therefore, typically when a large amount of external 
memory is required in a system, it is interfaced to the primary bus. The ex- 
pansion bus (discussed in the next subsection) actually comprises two mu- 
tually exclusive interfaces, controlled by the MSTRB and lOSTRB signals 
respectively. Cycles on the expansion bus controlled by the MSTRB signal are 
identical in timing to cycles on the primary bus, with the exception that bank 
switching is not implemented on the expansion bus. Accordingly, the dis- 
cussion of primary bus cycles in this section applies equally to MSTRB cycles 
on the expansion bus. 

Although both the primary bus and the expansion bus may be used to inter- 
face to a wide variety of devices, the devices most commonly interfaced to 
these buses are memories. Therefore, detailed examples of memory interface 
will presented in this subsection. 

13.2.1 Zero Wait-State Interface To RAMs 

For full speed, zero wait-state interface to any devices, the TMS320C30 re- 
quires a read access time of 35 ns from address stable to data valid. Since, for 
most memories, access time from chip select is the same as access time from 
address, it is theoretically possible to use 35 ns memories at full speed with 
the TMS320C30. This, however, dictates that there be no delays present be- 
tween the processor and the memories. This is usually not the case in practice, 
due to interconnection delays and the fact that typically some gating is re- 
quired for chip select generation. Therefore, slightly faster memories are gen- 
erally required in most systems. If one level of reasonably high-speed (below 
10 ns in propagation delay) gating is used to generate chip select for the me- 
mories, 25 ns devices may be used. 

Among currently available RAMs, there are two distinct categories of devices 
with different interface characteristics. These two categories are RAMs with- 
out output enable control lines (OE), which include the 1 -bit wide organized 
RAMs and most of the 4-bit wide RAMs, and those with OE controls, which 
include the byte wide and a few of the 4-bit wide RAMs. Many of the fastest 
RAMs do not provide OE control, and use chip select (CS) controlled write 
cycles to insure that data outputs do not turn on for write operations. In CS 
controlled write cycles, the write control line (WE) goes low prior to CS going 
low, and internal logic holds the outputs disabled until the cycle is completed. 
Using CS controlled write cycles is an efficient way to interface fast RAMs 
without OE controls to the TMS320C30 at full speed. 

Figure 13-3 shows the TMS320C30 interfaced to Cypress Semiconductor's 
CY7C164 25 ns 16k x 4-bit CMOS static RAMs with zero wait states using 
CS controlled write cycles. These RAMs are arranged to implement 1 6k 32-bit 
words located at addresses OOOOOH thru 03FFFH, which are the first 16k 
words in external memory. Note that in Figure 1 3-3 the RDY input is tied low, 
selecting zero wait states for all accesses on the bus. 
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Figure 13-3. Ram Interface - No OE 



In this circuit, chip select is generated from STRB and A23 using a 74AS32, 
whose propagation delay is only 5.8 ns. Thus, the chip select delay added to 
the RAM's 25 ns chip select access time satisfies the TMS320C30's 35 ns read 
access time from address. This approach works well if only a single bank of 
external memory is implemented where the chip select decode can be accom- 
plished in only one level of gating. If more than one bank is required to im- 
plement very large memory spaces, bank switching can be used to provide for 
multiple bank select generation while still maintaining full speed accesses 
within each bank. Bank switching is discussed in detail in a later subsection. 
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Figure 13-5. Interface Write Timing 
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Figures 13-4 and 13-5 show the read an_d write timings of this interface, re- 
spectively. For read oper ations , WE (R/W) is inactive (high), and the device 
is selected whenever both STRB and A23 are low. The total time from address 
to data from the RAM is therefore: 

tacc = t1 + t2 = 5.8 + 25 = 30.8 ns 

This easily meets the TMS320C30's 35 ns access time requirement. For write 
oper ations , address and R/W change state far enough away in time from the 
low STRB_ pulse to allow this interface to easily meet specifications for most 
RAMs' CS controlled write cycles. In this case, the CY7C1 64s outputs disable 
at the beginning of the cycle well early enough (t1 =7 ns) to avoid bus con- 
tention with the TMS320C30. Data is then driven into the RAMs as STRB 
goes low. The RAMs require 1 3 ns of write data setup prior to CS going high, 
and this design provides around 65 ns (t2). A data hold time of ns (t3) is 
required by the RAMs, and this design provides greater than^lO ns. Finally, 
the RAMs setup and hold times for address with respect to CS of ns are also 
met with a clear margin. 

Some RAMs with OE controls can also use CS controlled write cycles and this 
interface may be used with some of these devices with OE tied low. There 
are, however, two requirements for the use of OE RAMs with this interface. 
First, the RAM's OE input must be gated with chip select and WE internally 
so that the device's outputs do not turn on unless a read is being performed. 
Secondly, the RAM must allow address inputs to change while WE is low, 
which some RAMs specifically prohibit. 

Many RAMs with OE controls that do not meet the design criteria for the cir- 
cuit shown in Figure 13-3 may be interfaced to the TMS320C30 using the 
approach shown in Figure 13-6 





Figure 13-6. RAM Interface - OE 
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This design shows an interface to Integrated Device Technology's IDT7198 
25 ns 1 6k X 4-bit CMOS static RAMs using Ol to enable and disable the data 
outputs. 

In this circuit, chip select is driven directly by a single address line, which lo- 
cates the RAM at addresses OOOOOH through 03FFFH in external memory. 
The RAM's WE input is generated by AN Ding R/W and STRB, and therefore 
WE goes low after CS only during write cycles. This satisfies the RAM's re- 
quirement that address never changes when WE is low. 

The timing of read operations, shown in Figure 13-7, is very straightforward 
since CS is driven directly. The read access time of the circuit, t1, is therefore 
simply the RAM's chip select/address access time, which is 25 ns. This pro- 
vides 10 ns of margin over the TMS320C30's 35 ns requirement. 
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During write operations, as shown in Figure 13-8, the RAM's outputs are 
disabled after a delay of t1 following R/W going low. This delay comprises 
the inverter propagation delay and the RAM's turn-off delay, therefore t1 is 
given by: 

t1 = 5 + 1 5 = 20 ns 

which results in the outputs being disabled no later than the falling edge of 
HI , thereby avoiding bus contention with the TMS320C30. The circuit's data 
setup and hold times of approximately 65 and 10 ns, respectively also easily 
meet the RAM's timing requirements. 

As with the circuit of Figure 13-3, if more complex chip select decode is re- 
quired than can be accomplished in time to meet zero wait state timing, wait 
states or bank switching techniques (discussed in a later subsection) should 
be used. 

It should be noted that the IDT7198's OE control is gated with CS internally, 
therefore the RAM's outputs are not enabled unless the device is selected. 
This is critical if there are any other devices connected to the same bus; if there 
are no other devices connected to the bus, then OE need not be gated inter- 
nally with chip select. 
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13.2.2 Ready Generation 




The use of wait states can greatly increase system flexibility and reduce hard- 
ware requirements over systems without wait state capability. The 
TMS320C30 has the capability of generating wait states on either the primary 
bus or the expansion bus; both buses have independent sets of ready control 
logic. Ready generation is discussed in this subsection from the perspective 
of the primary bus interface, however, wait state operation on the expansion 
bus is identical to that of the primary bus, therefore these discussions pertain 
equally well to expansion bus operation. Ready generation will not be in- 
cluded in the specific discussions of the expansion bus interface. 

Wait states are generated on the basis of the internal wait state generator, the 
external ready input (RDY), or the logical AND or OR of the two (see Section 
8.3). When enabled, internally generated wait states effect all external cycles, 
regardless of the address accessed. If different numbers of wait states are re- 
quired for various external devices, the external RDY input may be used to 
tailor wait state generation to specific system requirements. 

If the logical OR (or electrical AND since the signals are low true) of the ex- 
ternal and wait count ready signals is selected, the earlier of either of the two 
signals will generate a ready condition and allow the cycle to be completed. 
It is not required that both signals be present. 

The OR of the two ready signals can be used to implement wait states for 
devices which require a greater number of wait states than are implemented 
with external logic (up to eight). This feature is useful, for example, if a sys- 
tem contains some fast and some slow devices. In this case, fast devices can 
generate ready externally with a minimum of logic, and slow devices can use 
the internal wait counter for larger numbers of wait states. Thus, when fast 
devices are accessed, the external hardware responds promptly with ready 
which terminates the cycle. When slow devices are accessed, the external 
hardware does not respond, and the cycle is appropriately terminated after the 
internal wait count. 

The OR of the two ready signals may also be used if conditions occur which 
require termination of bus cycles prior to the number of wait states imple- 
mented with external logic. In this case, a shorter wait count is specified in- 
ternally than the number of wait states implemented with the external ready 
logic, and the bus cycle is terminated after the wait count. This feature may 
also be used as a safeguard against inadvertent accesses to nonexistent me- 
mory which would never respond with ready and therefore lock up the 
TMS320C30. 

If the OR of the two ready signals is used, however, and the internal wait state 
count is less than the number of wait states implemented externally, the ex- 
ternal ready generation logic must have the ability to reset its sequencing to 
allow a new cycle to begin immediately following the end of the internal wait 
count. This requires that, under these conditions, consecutive cycles must be 
from independently decoded areas of memory and that the external ready 
generation logic be capable of restarting its sequence as soon as a new cycle 
begins. Otherwise, the external ready generation logic may loose synchroni- 
zation with bus cycles and therefore generate improperly timed wait states. 
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If the logical AND (electrical OR) of the wait count and external ready signals 
is selected, the later of the two signals will control the internal ready signal, 
but both signals must occur. Accordingly, external ready control must be im- 
plemented for each wait state device in addition to the wait count ready signal 
being enabled. 

This feature is useful if there are devices in a system which are equipped to 
provide a ready signal but cannot respond quickly enough to meet the 
TMS320C30's timing requirements. In particular, if these devices normally 
indicate a ready condition and, when accessed, respond with a wait until they 
become ready, the logical AND of the two ready signals can be used to save 
hardware in the system. In this case, the internal wait counter can be used to 
provide wait states initially, and become ready after the external device has 
had time to send a not ready indication. The internal wait counter then re- 
mains ready until the external device also becomes ready, which terminates the 
cycle. 

Additionally, the AND of the two ready signals may be used for extending the 
number of wait states for devices which already have external ready logic im- 
plemented but require additional wait states under certain unique circum- 
stances. 

In the implementation of external ready generation hardware, the particular 
technique employed depends heavily on the specific characteristics of the 
system. The optimum approach to ready generation varies depending on the 
relative number of wait state and non-wait state devices in the system and the 
maximum number of wait states required for any one device. The approaches 
discussed here are intended to be general enough for most applications, and 
are easily modifiable to comprehend many different system conifigurations. 

In general, ready generation involves the following three functions: 

1 ) Segmentation of the address space in some fashion to distinguish fast 
and slow devices. 

2) Generate properly timed ready indications. 

3) Logically ORing all of the separate ready timing signals together to 
connect to the physical ready input. 

Segmentation of the address space is required so that a unique indication of 
each of the particular areas within the address space that require wait states 
can be obtained. This segmentation is commonly implemented in a system in 
the form of chip select generation. Chip select signals may be used to initiate 
wait states in many cases, however, occasionally chip select decoding con- 
siderations may provide signals which will not allow ready input timing re- 
quirements to be met. In this case, coarse address space segmentation may 
be made on the basis of a small number of address lines, where simpler gating 
allows signals to be generated more quickly. In either case, the signal indi- 
cating that a particular area of memory is being addressed is normally used to 
initiate a ready or wait state indication. 

Once the region of address space being accessed has been established, a 
timing circuit of some sort is normally used to provide a ready indication to the 
processor at the appropriate point in the cycle to satisfy each device's unique 
requirements. 
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Finally, since indications of ready status from multiple devices are typically 
present, an OR gate is commonly used to combine the signals to drive the RDY 
input. 

One of two basic approaches may be taken in the implementation of ready 
control logic depending upon the state in which the ready input is to be be- 
tween accesses. If RDY is low between accesses, the processor is always 
ready unless a wait state is required; if RDY is high between accesses, the 
processor will always enter a wait state unless a ready indication is generated. 

If RDY is low between accesses, control of full speed devices is straightfor- 
ward; no action is necessary since ready is always active unless otherwise re- 
quired. Devices requiring wait states, however, must drive ready high fast 
enough to meet the input timing requirements. Then, after an appropriate de- 
lay, a ready indication must be generated. This can be quite difficult in many 
circumstances since wait state devices are inherently slow and often require 
complex select decoding. 

If RDY is high between accesses, zero wait state devices, which tend to be 
inherently fast, can usually respond immediately with a ready indication. Wait 
state devices may simply delay their select signals appropriately to generate a 
ready. Typically, this approach results in the most efficient implementation of 
ready control logic. Figure 1 3-9 shows a circuit of this type which can be used 
to generate 0, 1 , or 2 wait states for multiple devices in a system. 
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74S114 
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— o— 
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Figure 13-9. Circuit For Generation of 0, 1, or 2 Wait States For 

Multiple Devices 



In this circuit, full speed devices drive ready directly through the 74AS21, and 
the two flip-flops delay wait state devices' select signals one or two HI cycles 
to provide 1 or 2 wait states. 

Considering the TMS320C30's ready delay time of 8 ns following address, 
zero wait state devices must use ungated address lines directly to drive the 
input of the 74AS21 , since this gate contributes a maximum propagation delay 
of 6 ns to the RDY signal. Thus, zero wait state devices should be grouped 
together within a coarse segmentation of address space if other devices in the 
system require wait states. 

With this circuit, devices requiring wait states may take up to 42 ns from a 
valid address on the TMS320C30 to provide inputs to the 74AS20's inputs. 
Typically, this allows sufficient time for any decoding required in generating 
select signals for slower devices in the system. For example, the 74ALS138 
driven by address and STRB, can generate select decodes in 22 ns, which ea- 
sily meets the TMS320C30's timing requirements. 
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With this circuit, unused inputs to either the 74AS20s or the 74AS21 should 
be tied to a logic high level to prevent noise from generating spurious wait 
states. 

If more than 2 wait states are required by devices within a system, other ap- 
proaches may be employed for ready generation. If between three and eight 
wait states are required, additional flip-flops may be included, in the same 
manner as shown in Figure 13-9, or internally generated wait states may be 
used in conjunction with external hardware. If greater than eight wait states 
are required, an external circuit using a counter may be used to supplement the 
internal wait state generators capabilities. 

13.2.3 Bank S\A^itching Techniques 

The TMS320C30's programmable bank switching feature can greatly ease 
system design when large amounts of memory are required. This feature is 
used to provide a period of time during which all device selects are disabled 
that would not normally be present otherwise (refer to Section 8.4 for further 
information regarding bank switching). During this interval, slow devices are 
allowed time to turn off before other devices have the opportunity to drive the 
data bus, thus avoiding bus contention. 

When bank switching is enabled, any time a portion of the high order address 
lines change, as defined by the contents of the BNKCMPR register, STRB goes 
high for one full HI cycle. Provided STRB is included in chip select decodes, 
this causes all devices to be disabled during this period. The next bank of 
devices is not enabled until STRB goes low again. 

Bank switching is not required during writes since these cycles always exhibit 
an inherent one-half HI cycle setup of address information before STRB goes 
low. Thus, when using bank switching for read/write devices, a minimum of 
half of one HI cycle of address setup is provided for all accesses. Therefore, 
large amounts of memory can be implemented without wait states or extra 
hardware required for isolation between banks. Also, note that access time for 
cycles during bank switching is the same as that of cycles without bank 
switching, and accordingly, full speed accesses may still be accomplished 
within each bank. 

The circuit shown in Figure 13-10 illustrates the use of bank switching with 
Cypress Semiconductor's CY7C185 25 ns 8kx8 CMOS static RAM. This cir- 
cuit implements 32k 32-bit words of memory with full speed zero wait state 
accesses within each bank. 
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Figure 13-10. Bank Switching For Cyprus Semiconductors CY7C185 



Each of the four banks in this circuit is selected using a decode of A15-A13 
generated by the 74AS138. With the BNKCiVIPR register set to >OBh, the 
banks will be selected on even 8k word boundaries starting at location zero 
in external memory space. 

This circuit could not have been implemented without bank switching, since 
data output's turn-on and turn-off delays would have caused bus conflicts, 
and full speed accesses do not allow enough time for chip select decoding for 
the four banks. Here, the propagation delay of the 74AS138 is only involved 
during bank switches, where there is sufficient time between cycles to allow 
new chip selects to be decoded. 

The timing of this circuit for read operations using bank switching is shown 
in Figure 13-11 With the BNKCMPR register set to >OBh, when a bank 
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switch occurs, the bank a ddres s on address lines A23-A13, is updated during 
the extra HI cycle while STRB is high. Then, after chip select decodes have 
stabilized, and the previously selected bank has disabled its outputs, STRB 
goes low for the next read cycle. Further accesses occur at full speed with the 
normal bus timings, as long as another bank switch is not necessary. Write 
cycles do not require bank switching due to the inherent address setup pro- 
vided in their timings. 
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Figure 13-11. Timing For Read Operations Using Bank Switching 




This timing is summarized in Table 1 3-1 . 

Table 13-1. Bank Switching Interface Timing 



Time 
Interval 


Event 


Time 
Period 


ti 


HI falling to address/STRB valid 


10 ns 


t2 


STRB to select delay 


4.5 ns 


t3 


Memory disable from select 


15 ns 


t4 


HI falling to STRB 


10 ns 


t5 


STRB to select delay 


4.5 ns 


t6 


Memory output enable delay 


3 ns 
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13.3 Expansion Bus interface 

The TMS320C30S expansion bus interface provides a second complete paral- 
lel bus which can be used to implement data transfers concurrently with, and 
independent of, operations on the primary bus. The expansion bus comprises 
two mutually exclusive interfaces controlled by the MSTRB and iOSTRB sig- 
nals, respectively. These two signals are activated depending on what section 
of the memory space is accessed. This subsection discusses interface to the 
expansion bus using IOSTRB ; MSTRB cycles are identical in timing to primary 
bus cycles, and are discussed in Section 13.2. 

Unlike thre primary bus, both read and write cycles on the I/O portion of the 
expansion bus are two HI cycles in duration and exhibit the same timing. Thr 
XR/W signal is high for reads and low for writes. Since I/O accesses talce two 
cycles, many peripherals that require wait states if interfaced either to the pri- 
mary bus or using MSTRB may be used in a system without the need for wait 
states. Specifically, any devices with address access times greater than the 
35 ns required by the primary bus but not less than 46 ns can be interfaced 
to the I/O bus without wait states. 

A/D converters are one common DSP system component which often falls 
into this category. These devices are available in many speed ranges and with 
a variety of features, and while some may require one or more wait states on 
the I/O bus, others may be used at full speed. 

One A/D converter that interfaces to the I/O bus without wait states and re- 
quires minimal additional logic is the ad 1332 from Analog Devices. Figure 
13-12 illustrates an interface to this device. 
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XR/W 
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YO > 



AD1332 




Figure 13-12. Expansion Bus Interface to A/D Converter 




The interface uses a 74ALS138 to decode chip select for the converter. This 
configuration is shown assuming that other peripheral devices in the system 
also require chip select decodes. XA(8-10) are decoded to locate the con- 
verter at address 0804000h, which is the beginning of the I/O address space. 
Other peripherals may also use the outputs of the decoder, which generates 
chip selects in the I/O address space on 256 word boundaries. 

XAO is used to drive the single address line required in interfacing to the con- 
verter. This input selects between an internal 32-word FIFO buffer and the 
A/D's control/status register. Thus, the FIFO is located at address 0804000h 
and the control/status register is located at address 0804001 h. 
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Since the converter requires RD and WR control signals rather th an WE and 
OE, random logic is u sed to generate these signals from lOSTRB and XR/W. 
The converter's IRQ (Interrupt Request) output is used to alert the 
TMS320C30 to various conditions of converter status. 

Figure 13-13 shows the timing for read and write operations between the 
TMS320C30 and the AD1332. Both operations are shown on the same tim- 
ing diagram since, unlike the primary bus, only data bus timing and the state 
of XR/W differ between the two different types of cycles. 
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Figure 13-13. Timing of Expansion Bus Interface 



In both cases, address and R/W are valid t1 = 10 ns after the falling edge of 
HI . After t2 = 1 7 ns, the propagation delay of the 74ALS1 38, the A/D con- 
verter's chip select goes lo w, selecting the device. Then, t3 = 10 ns after the 
rising edge of HI, lOSTRB goes low, and t4 = 5.8 ns following this, the RD or 
WR signal to the converter goes low, initiating either a read or write cycle, re- 
spectively. 

For a read operation, the A/D converter provides data back to the TMS320C30 
t4 + t5 = 30.8 ns after RD goes low. T his satis fies the TMS320C30's re- 
quirement of having data valid 35 ns after lOSTRB. For write operations, the 
A/D converter requires less than 5 ns of data setup and hold time with respect 
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to the rising edge of WR. Tliis is met witli a high degree of margin by the 
TMS320C30. 

It should be noted that for the AD1 332's FIFO to be clocked properly, the RD 
signal must go high between accesses to the device. Therefore, although the 
AD1332 may be fast enough i n som e cases to be used at speeds approaching 
those of the primary bus, the STRB signal on the primary bus stays low for 
multiple consecutive read cycles. The I/O bus, therefore, is the preferable 
choice for interface to this device. 
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13.4 System Control Functions 

There are several aspects of TMS320C30 system hardware design which are 
critical to overall system operation. These include such functions as clock and 
reset signal generation and interrupt control. 

13.4.1 Clock Oscillator Circuitry 

An input clock may be provided to the TMS320C30 either from an external 
clock input or by using the on-board oscillator. Unless special clock require- 
ments exist, using the on-board oscillator is generally a convenient method 
of clock generation. This method requires few external components and can 
provide stable, reliable clock generation for the device. 

Figure 1 3-1 4 shows a clock generator circuit using the internal oscillator. This 
circuit is designed to operate at 33.33 MHz and since crystals with funda- 
mental oscillation frequencies of 30 MHz and above are not readily available, 
a parallel-resonant third-overtone circuit is used. 



TMS320C30 
XI X2/CLKIN 



O 



47 pF 



33.33 MHz 

; 0.1 /tF 



2.6 /tH 



20 pF 



V V V 

Figure 13-14. Crystal Oscillator Circuit 



In a third-overtone oscillator, the crystal fundamental frequency must be at- 
tenuated so that oscillation is at the third harmonic. This is achieved with an 
LC circuit that filters out the fundamental, thus allowing oscillation at the third 
harmonic. The impedance of the LC network must be inductive at the crystal 
fundamental and capacitive at the third harmonic. The impedance of the LC 
circuit is given by: 

L 
C 




z(ui) 



uiL - ojC 



(3) 
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Therefore, the LC circuit has a pole at: 



= V" 



_1_ 

LC 



(4) 



At frequencies significantly lower than Up, the 1/(ajC) term in (3) becomes 
the dominating term, while wL can be neglected. This gives: 



z(aj) =ju)L 



for u << ojr 



(5) 



In (5), the LC circuit appears inductive at frequencies lower than up. On the 
other hand, at frequencies much higher than Wp, the ujL term is the dominant 
term in (3), and 1/(ujC) can be neglected. This gives: 



_1_ 
z(u)) = jwC 



for lU >>U)r 



(6) 



The LC circuit in (6) appears increasingly capacitive as frequency increases 
above Up. This is shown in Figure 13-15, which is a plot of the magnitude 
of the impedance of the LC circuit of Figure 13-14 versus frequency. 



\Z (o)) 



CAPACITIVE 
REGION 




(rad/s) 




Figure 13-15. Magnitude of the Impedance of the Oscillator LC 

Network 
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Based on the discussion above, the design of the LC circuit proceeds as fol- 
lows: 

1 ) Choose the pole frequency ujp approximately halfway between the crys- 
tal fundamental and the third harmonic. 

2) The circuit now appears inductive at the fundamental frequency and 
capacitive at the third harmonic. 

In the oscillator of Figure 13-14, choose Up = 22.2 MHz, which is approxi- 
mately halfway between the fundamental and the third harmonic. Choose C 
= 20 pF. Then, using (4), L = 2.6 pH. 



13.4.2 Reset Signal Generation 



The reset input controls initialization of internal TMS320C30 logic and also 
causes execution of the system initialization software. For proper system in- 
itialization, the reset signal must be applied at least ten HI cycles, i.e., 600 ns 
for a TMS320C30 operating at 33.33 MHz. Upon powerup, however, it can 
take 20 ms or more before the system oscillator reaches a stable operating 
state. Therefore, the powerup reset circuit should generate a low pulse on the 
reset line for 100 to 200 ms. Once a proper reset pulse has been applied, the 
processor fetches the reset vector from location zero which contains the ad- 
dress of the system initialization routine. Figure 13-16 shows a circuit which 
will generate an appropriate powerup reset signal. 

TMS320C25 



Rl =1 Mfi 



Ci =0.47 ,iF 




DGND 

Figure 13-16. Reset Circuit 
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The voltage on the reset pin (RESET) is controlled by the R1C1 network. After 
a reset, this voltage rises exponentially according to the time constant R1C1, 
as shown in Figure 13-17. 



VOLTAGE ' i 




vcc 



to -0 ti 



v - Vcc (1-e-t/n 



TIME 




Figure 13-17. Voltage on the TMS320C30 Reset Pin 
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The duration of the low pulse on the reset pin is approximately ti, which is the 
time it takes for the capacitor C^to be charged to 1 .5 V. This is approximately 
the voltage at which the reset input switches from a logic to a logic 1 . The 
capacitor voltage is given by: 



V = Vcc 



r -1 

J - e - T J 



(7) 



where t = RiCi is the reset circuit time constant. Solving (7) for t gives: 

r -^1 

t = -RiCi In [_1 -VccJ (8) 

Setting the following: 



Rl = 1 MO 
Ci = 0.47 mF 
Vcc = 5 V 
V = Vi = 1.5 V 

gives t = 167 ms. Therefore, the reset circuit of Figure 13-16 provides a low 
pulse of long enough duration to ensure the stabilization of the system oscil- 
lator upon powerup. 

Note that if synchronization of multiple TMS320C30's is required, all proces- 
sors should be provided with the same input clock and the same reset signal. 
After powerup, when the clock has stabilized, all processors may then be 
synchonized by generating a falling edge on the common reset signal. Since 
it is in fact the falling edge of reset that establishes synchronization, reset must 
be high for a period of time (at least ten HI cycles) initially. Following the 
falling edge, reset should remain low for at least ten HI cycles and then be 
driven high. This sequencing of reset may be accomplished using additional 
circuitry, based on either RC time delays or counters. 
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13.5 XDS1000 Target Design Considerations 

The TI\/IS320C30 Emulator is an Extended Developnnent System (XDS1000), 
which has all the features necessary for full-speed emulation. The 
TMS320C30 uses a revolutionary technology to allow complete emulation via 
a serial scan path. If the user provides a 1 2 pin header on their target system, 
realtime emulation can be performed using the TMS320C30 device in their 
target system. Refer to Appendix B, Section B.I .4 for a complete description 
of theXDSIOOO. 

To use the emulation connector of the XDS1000, the signals shown in Figure 
13-18 should be provided to a 12 pin header (two rows of six pins) with pin 
8 cut out to provide keying. 



EMU It 


1 


2 


GND 


EMUO^ 


3 


4 


GND 


EMU2t 


5 


6 


GND 


PD ( + 5 V) 


7 




NO PIN (KEY) 


EMUS 


9 


10 


GND 


H3 


11 


12 


GND 



HEADER DIMENSIONS: 

PIN TO PIN SPACING 0.100 IN. (X,Y) 
PIN WIDTH 0.025 IN. SQUARE POST 
PIN LENGTH 0.235 IN NOMINAL 



TOP VIEW 

"•"These signals should always be pulled up with separate 20 kQ resistors to +5 volts on the TMS320C30. 

Figure 13-18. 12 Pin Header Signals 




Signal Description: 

EMUO Emulation pin 0. 

EMU1 Emulation pin 1 . 

EMU2 Emulation pin 2. 

EMU3 Emulation pin 3. 

H3 TMS320C30 H3 

PD Presence detect. 



It indicates that the cable is connected 
and target system is powered up. It 
should be tied to +5 volts in the target 
system. 
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Figure 13-19 is a diagram of the typical setup when using the emulation 
connection of the XDS1 000. 



12 PIN 
HEADER 



36" MINI 
COAX 




TMS320C30 



XDS1000 

CONTROLLER 

CARD 



4" 
MINI 
COAX 
USER 
TARGET SYSTEM 



Figure 13-19. Typical Setup For Using the Emulation Connection of the 

XDS1000 



For unbuffered signals, the distance between the TMS320C30 emulation pins 
(EMUO, EMU1, EMU2, EMU3, and H3) and the 12 pin header should be less 
than two inches. If the distance between the header and the TMS320C30 
emulation pins is more than two inches but less than six inches, the EMUS 
and H3 signals should be buffered. The buffer should be noninverting with a 
worst case propagation delay of 6.0 ns. For TMS320C30 emulation pins to 
12 pin header distances greater than six inches, all emulation signals should 
be buffered. Recall that EMUO, EMU1, and EMU2 are inputs and EMUS and 
HS are outputs. The buffer should have the same characteristics as given 
above. 
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Appendix A 

TnilS320C30 Timing Specifications & Dimensions 



This section provides timing specifications and dimensions for the 
TMS320C30 (third-generation TMS320) processor. In order to provide in- 
formation in advance of the complete data sheet, this section is included. 
Characterization data on the TMS320C30 is still being collected. A complete 
data sheet with additional information will be available in the future. Please 
contact the local Tl field sales office to obtain these data sheets. 
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Table A-1. Absolute Maximum Ratings Over Specified Temperature Range 



Condition/Characteristic 


Range 


Supply voltage range, Vqq 


-0.3 V to 7 V 


Input voltage range 


-0.3 V to 7 V 


Output voltage range 


-0.3 V to 7 V 


Continuous power dissipation 


2.0 W 


Operating free-air temperature range 


0°C to 70°C 


Storage temperature range 


-55°Cto150°C 



Notes: 

1 ) Stresses beyond those listed under 'Absolute Maximum Ratings' may 
cause permanent damage to the device. This is a stress rating only and 
functional operation of the device at these or any other conditions be- 
yond those indicated in the 'Recommended Operating Conditions' sec- 
tion of this specification is not implied. Exposure to 
absolute-maximum-rated conditions for extended periods may affect 
device reliability. 

2) All voltage values are with respect to Vss- 

Table A-2. Recommended Operating Conditions 



Operating Condition 


MIn 


Nom Max 


Unit 


Vdo Supply voltages (DVDD, etc.) 


4.75 


5 5.25 


V 


Vss Supply voltages (CVSS, etc.) 





V 


V|H High-level input voltage 


2 


Vdd + 0.3 


V 


V|L Low-level input voltage 


-0.3 


0.8 


V 


Iqh High-level output current 


300 


ma 


Iql Low-level output current 


2 


mA 


T Operating free-air temperature 





70 


°C 



ri^i] 
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Table A-3. Electrical Characteristics Over Specified Free-Air Temperature 

Range 



Electrical Characteristic 


Min 


Norn 


Max 


Unit 


Vqh High-level output voltage (Vdq = Min,l qh ~ Max) 


2.4 


3 




V 


Vol Low-level output voltage (Vdq = Min.l ql ^ Max) 




0.3 


0.6 


V 


\z Three-state current (Vpo = Max) 


-20 




20 


ma. 


l| Input current (V| = Vgs toV qq) 


-10 




10 


ma 


Ice Supply current (T^ = 25°C,Vod= Max, f ^ = Max) 


300 


mA 


C| Input capacitance 


15 


PF 


Cq Output capacitance 


15 


pF 



Notes: 

1 ) All typical values are at Vdd = 5 V, Ta = 25°C. 

2) fx is the input clock frequency. The maximum value is 33.3 MHz. 

3) All input and output voltage levels are TTL compatible. 



2.15 V 



Rl = 825 n 



FROM OUTPUT. 
UNDER TEST 



TEST POINT 



Cl = 100 pF 



Figure A-1. Test Load Circuit 
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Figure A-2. X2/CLKIN Timing 
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Figure A-3. H1/H3 Timing 
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Table A-4. Switching Characteristics for CLKIN, H1, and H3 



No. 


Name 


Description 


Min Typ Max 


Unit 


(1) 


tf(CI) 


CLKIN fall time 


5 


ns 


(2) 


tw(CIL) 


CLKIN low pulse duration 
tc(CI) = 30 ns 


10 


ns 


(3) 


tw(CIH) 


CLKIN high pulse duration 
tc(CI) = 30 ns 


10 


ns 


(4) 


tr(CI) 


CLKIN rise time 


5 


ns 


(5) 


tc(CI) 


CLKIN cycle time 


30 


ns 


(6) 


tf(H) 


H1/H3 fall time 


1 3 


ns 


(7) 


tw(HL) 


H1/H3 low pulse duration 


P- 6 


ns 


(8) 


tw(HH) 


H1/H3 high pulse duration 


P-7 


ns 


(9) 


tr(H) 


H1/H3 rise time 


2 4 


ns 


(9.1) 


td(HL-HH) 


Delay from HI (H3) low to 
H3(H1) high 


5 


ns 


(9.2) 


td(HH - HL) 


Delay from H1{H3) high to 
H3(H1) low 


5 


ns 


(10) 


tc(H) 


H1/H3 cycle time 


60 


ns 



Note: P = tc(CI) 



Table A-5. Switching Characteristics for a memory ((M)STRB = 0) read 



No. 


Name 


Description 


Min Typ Max 


Unit 


(11) 


td(H1L- (M)SL) 


HI low to (M)STRB low 


10 


ns 


(12) 


td(H1L- (M)SH) 


HI low to (M)STRB high 


10 


ns 


(13) 


td(H1H - (lO)RWL) 


HI high to (IO)R/Wlow 


10 


ns 


(14) 


td(H1L-(IO)A) 


HI low to (IO)A valid 


10 


ns 


(15) 


tsu((IO)D)R 


(IO)D valid before HI low 
(read) 


15 


ns 


(16) 


th((IO)D)R 


(10) D hold time after HI 





ns 


(17) 


tsu((IO)RDY) 


(10) RDY valid before HI 
high 


8 


ns 


(18) 


th((IO(RDY) 


(10) RDY hold time after 
HI high 





ns 
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Figure A-4. Memory ((l\/I)STRB = 0) Read 



Table A-6. Switching Characteristics for a memory ((M)STRB = 0) Write 



No. 


Name 


Description 


Min Typ Max 


Unit 


(19) 


td(H1H - (lO)RWH) 


HI high to (IO)R/Whigh 


10 


ns 


(20) 


tv((IO)D)W 


(IO)D valid after HI low 
(write) 


20 


ns 


(21) 


th((IO)D)W 


(10) D hold time after HI 
high (write) 





ns 
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Figure A-6. Memory ((M)STRB = 0) Read 
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Table A-7. Switching Characteristics for a Memory (iOSTRB = 0) Read 



No. 


Name 


Description 


Min Typ Max 


Unit 


(11.1) 


td(H1H - lOSL) 


HI high to IOSTRB low 


10 


ns 


(12.1) 


td(H1H - lOSH) 


HI high to IOSTRB high 


10 


ns 


(22) 


td(H1L- lORWH) 


HI low to lOR/Whigh 


10 


ns 


(14.1) 


td(H1L- lOA) 


HI low to lOA valid 


10 


ns 


(15.1) 


tsu(IOD)R 


lOD valid before HI high 
(read) 


15 


ns 


(16.1) 


th(IOD)R 


lOD hold time after HI 
high (read) 





ns 


(17.1) 


tsu(IORDY) 


lORDY valid before HI high 


8 


ns 


(18.1) 


th(IORDY) 


lORDY hold time after HI 
high 





ns 



Table A-8. Switching Characteristics for a Memory (IOSTRB = 0) Write 



No. 


Name 


Description 


Min 


Typ Max 


Unit 


(23) 


td(H1L- lORWL) 


HI low to (IO)R/Wlow 





10 


ns 


(20.1) 


tv(IOD)W 


lOD valid before HI low 
(write) 


15 


ns 


(21.1) 


tv(IOD)W 


lOD hold time after HI low 
(write) 





ns 
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Figure A-7. Memory (lOSTRB = 0) Write 
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Figure A-8. Timing for XFO and XF1 When Executing a LDFI or LDII 
Table A-9. Information for Figure A-8 



No. 


Name 


Description 


Min 


Typ Max 


Unit 


(1) 


td(H3H-XF0L) 


H3 high to XFO low 


10 


ns 


(2) 


tsu(XF1) 


XF1 valid before HI low 


8 


ns 


(3) 


th(XFI) 


XF1 hold time after HI low 





ns 




ri^i 
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Figure A-9. Timing for XFO When Executing a STFI or STII 



Table A-10. Information for Figure A-9 



No. 


Name 


Description 


Min Typ Max 


Unit 


(1) 


td{H3H-XF0H) 


H3 high to XFO high 


10 


ns 
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Figure A-10. Timing for XFO and XF1 When Executing SIGI 



Table A-11. Information for Figure A-10 



No. 


Name 


Description 


Min Typ Max 


Unit 


(1) 


td{H3H-XF0L) 


H3 high to XFO low 


10 


ns 


(2) 


td(H3H-XF0H) 


H3 high to XFO high 


10 


ns 


(3) 


tsu(XFI) 


XF1 valid before HI low 


8 


ns 


(4) 


th(XFI) 


XFl hold time after HI low 





ns 




riBi 
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Figure A-11. Timing for Loading XF Register When Configured as an Output 

Pin 



Table A-12. Information for Figure A-11 



No. 


Name 


Description 


Min Typ Max 


Unit 


(1) 


tv(H3H-XF) 


H3 high to XF valid 


10 


ns 




iBil 
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Figure A-12. Change of XF From Output to Input Mode 



Table A-13. Information for Figure A-12 



No. 


Name 


Description 


Min Typ Max 


Unit 


(1) 


td(H3H-XF0l) 


H3 high to XF switching 
from output to input 


15 


ns 


(2) 


tsu(XF) 


XF setup before HI low 


10 


ns 


(3) 


th{XF) 


XF hold before HI low 





ns 




h^9 
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Figure A-13. Change of XF From Input to Output Mode 



Table A-14. Information for Figure A-13 



No. 


Name 


Description 


MIn Typ Max 


Unit 


(1) 


td(H3H-XFI0) 


H3 high to XF switching 
from input to output 


10 


ns 
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Figure A-14. RESET Timing 
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Table A-15. Information for Figure A-14 



No. 


Name 


Description 


MIn Typ Max 


Unit 


(1) 


tsu( RESET) 


Setup for RESET 
before CLKIN low 


10 


ns 


(2) 


td(CLKINH-HIH) 


CLKIN high to HI high 


15 


ns 


(3) 


tsu(RESETH-HIL) 


Setup for RESET high 
before HI low and after 10 
HI clock cycles 


10 


ns 


(5) 


td(CLKINH-H3L) 


CLKIN high to H3 low 


15 


ns 


(8) 


tdis(HIH-XD) 


HI high to (X)D three state 


15 


ns 


(9) 


tdis(H3H-XA) 


H3 high to (X)A three state 


10 


ns 


(10) 


td(H3H- 
CONTROLH) 


H3 high to control signals 
high 


10 


ns 


(11) 


td(HIH-IACKH) 


HI high to JACK high 


10 


ns 


(12) 


tdis(RESETL- 
ASYNCH) 


RESET low to 

asynchronously reset signals 
three state 


15 


ns 




Ml] 
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Figure A-15. RESET and INT(3-0) Response Timing 



Table A-16. Information for Figure A-15 



No. 


Name 


Description 


Min 


Typ Max 


Unit 


(1) 


tsu(INT) 


INT(3-0) setup before HI low 


10 


ns 


(2) 


tw(INT)/Vofe / 


Interrupt pulse width to 
guarantee one interrupt seen 


P 


1.5P <2P 


ns 



Note 1: Interrupt pulse width must be at least 1 P wide to guarantee it will be seen. It must be less than 
2 P wide to guarantee it will be responded to only once. The recommended pulse width is 1.5 P. 
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Figure A-16. lACK Timing 



Table A-17. information for Figure A-16 



No. 


Name 


Description 


MIn Typ Max 


Unit 


(1) 


tdCHIH-IACKL) 


HI high to lACK low 


10 


ns 


(2) 


td(HIH-IACKH) 


HI high to lACK high during 
first cycle of JACK instruction 
data read 


10 


ns 
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Figure A-17. TRAP Response Timing 
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NOTE: Timing diagrams show operation with CLKXP = CLKRP = FSXP = FSRP = 0. 



Figure A-18. Fixed Data Rate Mode 
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NOTES: 1. Timing diagrams show operation with CLKXP = CLKRP = FSXP = FSRP = 0. 

2. Timings not expressly specified for variable data rate mode are the same as those for fixed data rate mode. 
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Figure A-19. Variable Data Rate Mode 
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Table A-18. Serial Port Timing as Shown in Figures A-18 and A-19 



No. 


Name 


Description 


Max 


Min 


Unit 


(1) 


td(HI-SCK) 


HI high to internal CLKX/R. 


15 


ns 


(2) 


tc(SCK) 


CLKX/R cycle 
time. 


CLKX/R external 


tc(H)x2.6 


ns 


CLKX/R internal 


tc(H)x234 


tc(H)x2 


(3) 


tw(SCK) 


CLKX/R high/ 
low pulsewidth 


CLKX/R external 


tc(H)+5 


ns 


CLKX/R internal 


tc(SCK)/2 


[tc(SCK)/2]-15 


(4) 


tr(SCK) 


CLKX/R rise time. 


8 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 


8 


ns 


(6) 


td(DX) 


CLKX to DX 
valid. 


CLKX external 


35 


ns 


CLKX internal 


20 


(7) 


tsu(DR) 


DR setup before 
CLKR. 


CLKR external 


10 


ns 


CLKR internal 


25 


(8) 


thCDR) 


DR hold from 
CLKR. 


CLKR external 


10 


ns 


CLKR internal 


-5 


(9) 


td(FSX) 


CLKX to internal 
FSX. 


CLKX external 


32 


ns 


CLKX internal 


17 


(10) 


tsu(FSR) 


FSR setup 
before CLKR. 


CLKR external 


10 


ns 


CLKR internal 


10 


(11) 


th(FS) 


FSX/R input 

hold from 
CLKX/R. 


CLKX/R external 


10 


ns 


CLKX/R internal 


-5 


(12) 


tsu(FSX) 


External FSX 

setup before 
CLKX. 


CLKX external 


[t c(CLKX)/2] 


-10 -[tc(H)-8] 


ns 


CLKX internal 


tc(CLKX)/2 


-[tc(H)-21] 


(13) 


td(CH-DX)V 


CLKX to first 

DX bit, FSX 
precedes CLKX. 


CLKX external 


36 


ns 


CLKX internal 


21 


(14) 


td(FSX-DX)V 


FSX to first DX bi 
FSX. 


t, CLKX precedes 


36 


ns 




m*] 
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Figure A-20. HOLD/HOLDA Timing 



Table A-19. Information for Figure A-20 



No. 


Name 


Description 


Min 


Max 


Unit 


(1) 


tsu(HOLD) 


HOLD valid before HI low 


15 


ns 


(3) 


tv(HOLD) 


HOLD valid after HI low 





10 


ns 


(4) 


UHOLD) 


HOLD low width 


2 


HI cycles 


(6) 


tw(HOLDA) 


HOLDA low width 


1 


HI cycle 


(V) 


td(H1L-SH)H 


HI low to STRB high for a 
HOLD 





10 


n s 


(8) 


tdis(HIL-S) 


HI low to STRB three state 





10 


ns 


(9) 


ten(HIL-S) 


HI low to STRB active 





10 


ns 


(10) 


tdis(HIL-RW) 


HI low to R/W three state 





10 


ns 


(11) 


ten(HIL-RW) 


HI low to R/W active 





10 


ns 


(12) 


tdis(HIL-A) 


HI low to address three state 





10 


ns 


(13) 


ten(HIL-A) 


HI low to address valid 





10 


ns 


(16) 


tdis(HIH-D) 


HI high to data three state 





10 


ns 
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Figure A-21 . TMS320C30 180 Pin PGA Dimensions 
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Appendix B 

Developmevit Support/Part Order Information 



This section provides development support information, device part numbers, 
and support tool ordering information for the TMS320C30 (third-generation 
TMS320) processor. Figure B-1 shows the software and hardware develop- 
ment tools available and the development environment for the TMS320C30. 
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Figure B-1. TMS320C30 Development Environment 
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Extensive documentation, including data sheets, user's guides, and application 
reports, is available to support DSP design. A series of DSP textbooks has 
been published both by Prentice-Hall and John Wiley and Sons to support 
research and education. Other support includes a technical support hotline 
(713-274-2320) and a bulletin board service (713-274-2323). Tl's Regional 
Technology Centers (RTCs) provide hands-on workshops and design ser- 
vices. 

Many third-parties and consultants with DSP expertise can assist in various 
application areas. TMS320C30 Algorithm Development Packages will be 
available from multiple third-parties and consultants in the near future. Sub- 
scribe to the DSP newsletter "Details on Signal Processing" for up to date 
information on new products and services from third-parties and consultants. 
Call Tl's Customer Response Center at (800) 232-3200 to subscribe to the 
newsletter. Contact the nearest Tl field sales office for support tool availability 
or further details (see list of sales offices and distributors at end of book). 

The major topics discussed in this section are listed below. 

• TMS320C30 Development Support (Section B.I on page B-4) 

- Macro Assembler/Linker 

- C Compiler 

- Simulator 

Extended Development System (XDS1000) 
TMS320 DSP Hotline/Bulletin Board Service 

• TMS320C30 Part Order Information (Section B.2 on page B-12) 

- Device part numbers 

- Software and hardware support tools part numbers 

- Device and support tool prefix designators 

- Device nomenclature 
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B.1 TMS320C30 Development Support 

Texas Instruments offers extensive development support and complete doc- 
umentation with the TMS320C30 (third-generation) digital signal processor. 
Tools are provided for the TMS320C30 to evaluate the performance of the 
processor, develop algorithm implementations, and fully integrate the design's 
software and hardware modules. Development operations are performed with 
the TMS320C30 Macro Assembler/Linker, C Compiler, Simulator, and Emu- 
lator (Extended Development System - XDS1000). 

A description and key features for each TMS320C30 development support 
tool is provided in the following subsections. For ordering information, see 
Section B.2. 

B.1.1 Macro Assembler/Linker 

The TMS320C30 Macro Assembler/Linker is a software tool that converts 
source mnemonics to executable object code. 

The following key features distinguish the TMS320C30 Macro 
Assembler/Linker: 

Macro capabilities and library functions 
Conditional assembly 
Relocatable modules 
Complete error diagnostics 
Symbol table and cross reference 

he TMS320C30 Macro Assembler/Linker is shipped with four programs to 
address specific needs. They are: 




1) 


The assembler 


2) 


The archiver 


3) 


The linker 


4) 


The object format converter 



These programs and their functionality are described in the following para- 
graphs. 

• The assembler translates assembly language source files into machine 
language object files. Source files can contain instructions, assembler 
directives, and macro directives. Assembler directives can be used to 
control various aspects of the assembly process, such as the source list- 
ing format, data alignment, and section content. 

• The archiver allows collection of a group of files into a single archive 
file. For example, several macros can be collected together into a macro 
library. The assembler will search through the library and use the mem- 
bers that are called as macros by the source file. It is also possible to 

i^ij use the archiver to collect a group of object files into an object library. 

The linker will include the members in the library that resolve external 
references during the link. 
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© The linker combines object files into a single executable object module. 
As it creates the executable module, it performs relocation and resolves 
external references. The linker accepts relocatable object files (created 
by the assembler) as input. It also accepts archive library members and 
output modules created by a previous linker run. Linker directives allow 
combining of file sections, binding of sections or symbols to addresses, 
and defining of global symbols. 

• The main purpose of this development process is to produce a module 
that can be executed in a system that contains a TMS320C30 device 
or the software or hardware development tools. (Note that only 
linked files can be executed). 

O Most EPROM programmers do not accept assembler/linker files as input. 
The object format converter converts the object file into Intel, Tek- 
tronix, or Tl-tagged object format. The converted file can be down- 
loaded to an EPROM programmer. This EPROM code can then be 
executed on the TMS320C30 device. 

Refer to Figure B-1 for a diagram of the development environment when using 
the Assembler/Linker. 

The macro assembler/linker is currently available for PC/MS-DOS, VAX VMS, 
SUN-3 UNIX , and VAX ULTRIX operating systems. 



B.1.2 C Compiler 



The optimizing C compiler is a full implementation of the standard Kernighan 
and Ritchie C. The compiler accepts a digital signal processing program 
written in C language. It outputs TMS320C30 assembly language source 
code which is then processed by the assembler where the TMS320C30 mne- 
monics are converted to object code. 

This high-level language compiler allows time-critical routines written in as- 
sembly language to be called from within the C program. The converse is also 
available; assembly routines may call C functions. The output of the compiler 
can be edited prior to assembly/link to further optimize the performance of the 
code. The compiler supports the insertion of assembly language code into C 
source code. The result is a compiler that allows the relative amounts of 
high-level programming and assembly language code to be tailored according 
to the application. Refer back to Figure B-1 for a diagram of the development 
environment when using the C compiler. 

The compiler is currently available for PC/MS-DOS, VAX VMS, SUN-3 UNIX, 
and VAX ULTRIX operating systems. The assembler/linker is included with the 
shipment of the TMS320C30 C compiler. The output of this assembler/linker 
can be downloaded and used with the simulator, XDS, or PROM programmer. 
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B.1.3 Simulator 




i^H 



The TMS320C30 Simulator is a software program that simulates operation of 
the TMS320C30. 

The following features highlight simulator capability for effective TMS320C30 
software development: 

Simulates the entire TMS320C30 digital signal processor instruction set 

Simulates the key TMS320C30 peripheral features (DMA, timers, and 
serial port) 

Command entry from either menu-driven keystrokes (menu mode) or 

from a batch file (line mode) 

Help menus for all screen modes 

Standard interface can be user customized 

Simulation parameters quickly stored/retrieved from files to facilitate 

preparation for individual sessions 

Reverse assembly allows editing and re-assembly of source statements 

Memory can be displayed (at same time) as: 

- hexadecimal 32-bit values 

- assembled source 

• Execution modes include: 

- single/multiple instruction count 

- single/multiple cycle count 

- until condition is met 

- while condition exists 

- for set loop count 

- unrestricted run with halt by key input 

• Easy to define trace expressions 

• Trace execution with display choices of: 

- designated expression values 

- cache registers 

- instruction pipeline for easy optimization of code 

• Breakpoint conditions include: : 

- address read 

- address write 

- address read or write 

- address execute 

- expression valid 

• Simulates cache utilization 

• Cycle counting 

- display the number of clock cycles in single step or run mode 

- external memory can be configured with wait states for accurate 
cycle counting 

The simulator allows verification and monitoring of the state of the processor. 
Simulation speed is on the order of thousands of instructions per 
second (VAX/VMS, VAX/ULTRIX, and SUN-3 UNIX) or hundreds of in- 
structions per second (PC/MS-DOS). 
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The simulators use TMS320C30 object code, produced by the Macro 
Assembler/Linker. Input and output files may be associated with the port 
addresses of the I/O instructions in order to simulate I/O devices connected 
to the processor. Before initiating program execution, breakpoints may be set, 
and the trace format defined. 

During program execution, the internal registers and memory of the simulated 
TMS320C30 are modified as each instruction is interpreted by the host com- 
puter. Execution is suspended when one of the following conditions exists: 

1 ) A breakpoint or error is encountered. 

2) Execution is halted. 

Once program execution is suspended, the internal registers and both program 
and data memories can be inspected and/or modified. The trace memory can 
also be displayed. A record of the simulation session can be maintained in a 
journal file, so that it can be re-executed to regain the same machine state 
during another simulation session. 

The user interface in the simulator is identical to that in the XDS. See Figure 
B-2 for an example of the user interface. 
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CODE WINDOW 
Shows source code (LO command loads code from <fllename>) 




COMMAND LINE • 
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ENABLED PF KEYS 

DISPLAY WINDOW 
Shows: Banner (DC command) 

Expressions (DE commands) 
Files (DF command) 
Memory (DM command) 
Symbols (DS command) 
(And other displays.) 



REGISTER DISPLAY WINDOWS - 



Figure B-2. TI\/IS320C30 Simulator User Interface 



The simulator is currently available from Tl for PC/MS-DOS, VAX VMS, and 
VAX ULTRIX operating systems. A SUN-3 UNIX version of the simulator can 
be purchased from a third party: Spectron Microsystems Inc. This version is 
the same as Tl's simulator for the PC/MS-DOS, VAX VMS, and VAX ULTRIX. 
Contact Spectron at (805 967-0503) for more information. 
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B.I .4 TI\/IS320C30 Emulator - Extended Development System (XDS1000) 

The TMS320C30 Emulator (XDS1000) is a user-friendly system that has all 
the features necessary for full-speed emulation to debug hardware, software, 
or integrate the software with the hardware. Some of the XDS1 OOO's features 
include: 

O Full-speed execution and monitoring out of the customers target system 
via a 12 pin target connector 

O Software breakpoint 

O Software trace 

O Software timing capabilities 

O Single-step execution 

O Inspect/modify registers and program/data memory 

O Upload/download capabilities to/from data/program memory 

O Windowed user interface similar to the TI\/IS320C30 simulator 

Full-speed execution and monitoring of the customers target system via a 12 
pin target connector has the advantage of using a serial scan path to give ac- 
cess to the internal registers as well as internal and external memory of the 
device. Since execution is out of the TMS320C30 located in the target sys- 
tem, there is no timing difference during emulation. 

Software breakpoints means the program can be stopped on a specific ad- 
dress. When the program counter reaches the designated breakpoint address, 
the emulator will halt execution and allow the user to observe the status of the 
TMS320C30 (i.e., inspect memory or registers). Software trace allows view- 
ing of the TI\/IS320C30's state when a breakpoint is reached. This information 
can be saved to a file for future analysis. Software timing permits keeping 
track of clock ticks between breakpoints or while program single stepping. 

The XDS1000 consists of two full-size PC-XT/ AT cards. One card is the 
TMS320C30 XDS1000 Controller Card, the other is the TMS320C30 
XDS1000 Development Board. 

The TMS320C30 XDS1000 Controller Card is responsible for interpreting 
commands sent from the PC and converting those commands into appropriate 
signal sequences to control the TMS320C30 in the user's target system. 

The TMS320C30 XDS1000 Development Board is a predefined target system 
that contains: 

O A TMS320C30 device 

O 16K X 32-bits full-speed (zero wait state) SRAM on the primary bus 
O Two selectable banks of 8K x 32-bits full-speed (zero wait state) SRAM 
on the expansion bus 

See figure Figure B-3 for a visual representation of the TMS320C30 
XDS1 OOO's development environment. 




fX9 
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TMS320C30 XDS1000 
Controller Card 





User Target System 
□ TMS320C30 



User Memory and I/O 



Hardware/Software 
Development Environment 



TMS320C30 XDS1000 Controller Card 



TMS320C30 XDS1000 
Development Board 



•oiler Card >s 



Memory 




Algorithm Development 
Environment 



Figure B-3. TI\/IS320C30 XDS1000 Development Environment 




This figure shows the two environments in which the TMS320C30 XDS1000 
can operate: 

1 ) The hardware/software development configures the TMS320C30 
XDS1000 and the user's target system in the emulator mode. Section 
13.5 of this document shows the 12-pin header or emulator connector 
necessary for the user's target system to work with the TMS320C30 
XDS1000. 

2) The algorithm development environment allows the user to debug his 
software before the user's target system is built. In this configuration, 
the TMS320C30 XDS1000 Development Board can be used in place of 
the user's target system. In this mode, code can be downloaded into the 
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memory on the TMS320C30 XDS1 000 Development Board and execute 
at full speed. 

To use the TMS320C30 XDS1000, the following equipment is required: 

© IBM PC-XT/ AT compatible 

O Two and one-half eight-bit slots for the PC-AT, three full-size eight-bit 
slots for the PC-XT 

O A minimum of 640K bytes of memory in the PC 

O PC/MS DOS rev 2.0 or later 

In summary, the TI\/IS320C30 XDS1000 is a full-speed emulator that comes 
with a pre built target system for early design development. The TMS320C30 
XDS1000 can help debug hardware in realtime, debug software in realtime, 
and integrate the hardware and software together. 

B.1.5 TMS320 DSP Hotline/Bulletin Board Service 

The TMS320 group at Texas Instruments provides a DSP Hotline to answer 
TMS320 technical questions such as device problems, development tools, 
documentation, upgrades, and new TI\/IS320 products. The hotline is open 
five days a week from 8:00 AM to 6:00 PM Central Time. The phone number 
is (713) 274-2320. For pricing and availability of TMS320 devices and de- 
velopment tools, contact the nearest Tl sales office. To order literature, call the 
Customer Response Center (CRC) at (800) 232-3200. 

The TMS320 DSP Bulletin Board Service is a telephone-line computer bulletin 
board that provides access to information pertaining to TMS320 devices. 
Specification updates for current or new TMS320 devices and development 
tools are communicated via the bulletin board as the information becomes 
available. The Bulletin Board Service can be accessed by dialing (713) 
274-2323 with a 300, 1200, or 2400-bps modem. 

The bulletin board contains TMS320C30 source code from Section 12 of the 
TMS320C30 Users Guide as well as development tool and silicon revisions 
and enhancements. The bulletin board also provides new DSP application 
software as it becomes available. See the TMS320 Family Development Sup- 
port Reference Guide for further information on how to access the bulletin 
board. 
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B.2 TMS320C30 Part Order Information 

This section provides the device and support tool part numbers. Table B-1 
lists the part numbers for the TI\/IS320C30, and Table B-2 gives ordering in- 
formation for TMS320C30 hardware and software support tools. A discussion 
of the TMS320 family device and development support tool prefix designators 
is included to assist in understanding theTMS320 product numbering system. 

Table B-1. TMS320C30 Digital Signal Processor Part Numbers 



DEVICE 


TECHISIOLOGY 


OPERATING 
FREQUENCY 


PACKAGE 
TYPE 


TYPICAL 
DISSIPATION 


tTMX320C30GBH 


I.O-pmCMOS 


33 MHz 


Ceramic 180-pin PGA 


1.5 W 



tMilitary version planned; contact nearest sales office for availability. 



Table B-2. TMS320C30 Support Tool Part Numbers 



TOOL DESCRIPTION 


OPERATING SYSTEM 


PART NUMBER 


SOFTWARE 


Macro Assembler/Linker 


VAX VMS 


TMDX3243250-08 




PC/MS-DOS 


TMDX3243850-02 




SUN-3 UNIX * 


TMDX3243550-08 




VAX ULTRIX 


TMDX3243260-08 


C Compiler & Macro Assembler/ 


VAX VMS 


TMDX3243255-08 


Linker 


PC/MS DOS 


TMDX3243855-02 




SUN-3 UNIX * 


TMDX3243555-08 




VAX ULTRIX 


TMDX3243265-08 


Simulator 


VAX VMS 


TMDX3243251-08 




PC/MS-DOS 


TMDX3243851-02 




SUN-3 UNIX * 


Offered by 
Spectron Inc. 
(805) 967-0503 




VAX ULTRIX 


TMDX3243261-08 


HARDWARE 


XDS1000 


PC/MS-DOS 


TMDX3261030 



• Please note SUN UNIX support for TMS320C30 software tools is for the 68000 family 
based SUN-3 series workstations. These tools are NOT SUPPORTED on the SUN-4 
series machines that use the SPARC processor, or the SUN-386i series of workstations. 
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B.2.1 Device and Development Support Tool Prefix Designators 

To assist the user in understanding the stages in the product development 
cycle, Texas Instruments assigns prefix designators in the part number no- 
menclature. A device prefix designator has three options: TMX, TMP, and 
TMS, and a development support tool prefix designator has two options: 
TMDX and TMDS. These prefixes are representative of the evolutionary stages 
of product development from engineering prototypes (TMX/TMDX) through 
fully qualified production devices (TMS/TMDS). This development flow is 
defined below. 

Device Development Evolutionary Flow: 

TMX Experimental device that is not necessarily representative of the final 
device's electrical specifications. 

TMP Final silicon die that conforms to the device's electrical specifications 
but has not completed quality and reliability verification. 

TMS Fully qualified production device. 

Support Tool Development Evolutionary Flow: 

TMDX Development support product that has not yet completed Texas In- 
struments internal qualification testing. 

TMDS Fully qualified development support product. 



TMX and TMP devices and TMDX development support tools are shipped 
with the following disclaimer: 

"Developmental product is intended for internal evaluation purposes." 



Note: 

Texas Instruments recommends that prototype devices (TMX or TMP) not 
be used in production systems since their expected end-use failure rate is 
undefined but predicted to be greater than standard qualified production 
devices. 



TMS devices and TMDS development support tools have been fully charac- 
terized and the quality and reliability of the device has been fully demon- 
strated. Texas Instruments standard warranty applies. 
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B.2.2 Device Nomenclature 



In addition to the prefix, the device nomenclature includes a suffix that follows 
the device family name. This suffix indicates the package type (e.g., N, FN, 
or GB) and temperature range (e.g., L). Figure B-4 provides a legend for 
reading the complete device name for any TMS320 family member. 



TMX 320 C 30 GB H 



PREFIX 

TMX = experimental device 
TMP = prototype device 
TMS = qualified device 
SMJ = MIL-STD-883C 



DEVICE FAMILY 

320 = TMS320 family 



TECHNOLOGY 

C = CMOS 

E = CMOS EPROM 

No letter = NMOS 



DEVICE 

Ist-gen. DSP: 

10 

15 

17 
2nd-gen. DSP: 

20 

25 
3rd-gen. DSP: 

30 



L 



TEMPERATURE RANGE 

H = to 50''C 
L = to 70°C 
S = -55 to 1 0OX 
M = -55 to 125°C 
A = -40 to 85°C 

PACKAGE TYPE 

N = plastic DIP 
JD = ceramic DIP 
side-brazed 
FN = plastic leaded CC 
GB = ceramic PGA 
FJ = ceramic leaded CC 
FD = leadless ceramic CC 



ri^Bi 



Figure B-4. TMS320 Device Nomenclature 
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The opcode fields for all the TMS320C30 instructions are shown in Table C-1 . 
Bits in the table marked with a hyphen are defined in the individual instruction 
description (see Section 11). Table C-1 along with the instruction de- 
scriptions fully define the instruction words. The opcodes are listed in nu- 
merical order. 



Table C-1. TI\/IS320C30 Instruction Opcodes 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


ABSF 





























ABSi 


























1 


ADDC 























1 





ADDF 























1 


1 


ADDI 




















1 








AND 




















1 





1 


ANDN 




















1 


1 





ASH 




















1 


1 


1 


CMPF 




























CMPI 

























1 


FIX 






















1 





FLOAT 






















1 


1 


IDLE 



















1 








LDE 



















1 





1 


LDF 



















1 


1 





LDFI 



















1 


1 


1 


LDI 




























LDII 

























1 


LDM 






















1 





LSH 






















1 


1 


MPYF 



















1 








MPYI 



















1 





1 


NEGB 



















1 


1 





NEGF 



















1 


1 


1 


NEGI 
















1 











NOP 
















1 








1 
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Table C-1. TMS320C30 Instruction Opcodes (Continued) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


NORM 





















1 





NOT 





















1 


1 


POP 


















1 








POPF 


















1 





1 


PUSH 


















1 


1 





PUSHF 


















1 


1 


1 


OR 




























RND 






















1 





ROL 






















1 


1 


ROLC 



















1 








ROR 



















1 





1 


RORC 



















1 


1 





RPTS 



















1 


1 


1 


STF 



























STFI 
























1 


STI 





















1 





STII 





















1 


1 


SIG! 


















1 








SUBB 


















1 





1 


SUBC 


















1 


1 





SUBF 


















1 


1 


1 


SUBI 



























SUBRB 



























SUBRF 





















1 





SUBRI 





















1 


1 


TSTB 


















1 








XOR 


















1 





1 


lACK 


















1 


1 





ADDC3 




























ADDF3 

























1 


ADDIS 






















1 





AND3 






















1 


1 


ANDN3 



















1 








ASH3 



















1 





1 


CMPF3 



















1 


1 





CMPI3 



















1 


1 


1 


LSH3 
















1 
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Table C-1. TMS320C30 Instruction Opcodes (Continued) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


MPYF3 
























1 


MPYI3 





















1 





0R3 





















1 


1 


SUBB3 


















1 








SUBF3 


















1 





1 


SUBI3 


















1 


1 





TSTB3 


















1 


1 


1 


X0R3 













1 














LDFcond 













- 


- 


- 


- 


- 


LDlcond 










1 


- 


- 


- 


- 


- 


BR(D)t 





















- 


- 


CALL 


















1 


- 


- 


RPTB 















1 





- 


- 


SWI 















1 


1 


- 


- 


Bconc/(D)t 












1 





- 


- 


- 


DBcond{D}t 












1 


1 


- 


- 


- 


CALLcond 









1 








- 


- 


- 


TRAPcond 









1 





1 





- 


- 


RETlcond 









1 


1 














RETScond 









1 


1 











1 


MPYF3IIADDF3 





































1 
1 




1 



1 


- 


MPYF3IISUBF3 




























1 
1 
1 
1 





1 
1 




1 



1 


- 


MPYI3IIADDI3 

































1 
1 




1 



1 


- 


MPYI3IISUBI3 

























1 
1 
1 
1 





1 
1 




1 



1 


- 


STFIISTF 




1 

















- 




STIIISTI 




1 














1 


- 




LDFIILDF 




1 











1 





- 


- 



"f Opcode same for standard and delayed instructions. 
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Table C-1. TMS320C30 Instruction Opcodes (Concluded) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


LDIIILDI 















1 


1 


- 


- 


ABSFIISTF 












1 








- 


- 


ABSIIISTI 












1 





1 


- 


- 


ADDF3IISTF 












1 


1 





- 


- 


ADDI3IISTI 












1 


1 


1 


- 


- 


AND3IISTI 




















- 


- 


ASH3IISTI 

















1 


- 


- 


FIXIISTI 














1 





- 


- 


FLOATIISTF 














1 


1 


- 


- 


LDFIISTF 











1 








- 


- 


LDIIISTI 











1 





1 


- 


- 


LSH3IISTI 











1 


1 





- 


- 


MPYF3IISTF 











1 


1 


1 


- 


- 


MPYI3IISTI 




















- 


- 


NEGFIISTF 

















1 


- 


- 


NEGIIISTI 














1 





- 


- 


NOTIISTI 














1 


1 


- 


- 


0R3IISTI 











1 








- 


- 


SUBF3IISTF 











1 





1 


- 


- 


SUBI31ISTI 











1 


1 





- 


- 


X0R3IISTI 











1 


1 


1 


- 


- 


Reserved for reset, 
traps, and interrupts 





1 


1 


1 


1 


1 


1 


1 


1 
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The quality and reliability performance of Texas Instruments Microprocessor 
and Microcontroller Products, which includes the three generations of 
TMS320 digital signal processors, relies on feedback from: 

• Our customers 

• Our total manufacturing operation from front-end wafer fabrication to 
final shipping inspection 

• Product quality and reliability monitoring. 

Our customer's perception of quality must be the governing criterion for 
judging performance. This concept is the basis for Texas Instruments Corpo- 
rate Quality Policy, which is as follows: 

"For every product or service we offer, we shall define the require- 
ments that solve the customer's problems, and we shall conform to 
those requirements without exception." 

Texas Instruments offers a leadership reliability qualification system, based on 
years of experience with leading-edge memory technology as well as years of 
research into customer requirements. Quality and reliability programs at Tl are 
therefore based on customer input and internal information to achieve con- 
stant improvement in quality and reliability. 
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D.1 Reliability Stress Tests 



Accelerated stress tests are performed on new semiconductor products and 
process changes to ensure product reliability excellence. The typical test en- 
vironments used to qualify new products or major changes in processing are: 

High-temperature operating life 

Storage life 

Temperature cycling 

Biased humidity 

Autoclave 

Electrostatic discharge 

Package integrity 

Electromigration 

Channel-hot electrons (performed on geometries less than 2.0 \irr\). 

ypical events or changes that require internal requalification of product in- 
clude: 

New die design, shrink, or layout 

Wafer process (baseline/control systems, flow, mask, chemicals, gases, 

dopants, passivation, or metal systems) 

Packaging assembly (baseline control systems or critical assembly 
equipment) 

Piece parts (such as lead frame, mold compound, mount material, bond 
wire, or lead finish) 

• Manufacturing site. 

Tl reliability control systems extend beyond qualification. Total reliability 
controls and management include product reliabily monitor as well as final 
product release controls. MOS memories, utilizing high-density active ele- 
ments, serve as the leading indicator in wafer-process integrity at Tl MOS fa- 
brication sites, enhancing all MOS logic device yields and reliability. Tl places 
more than several thousand MOS devices per month on reliability test to en- 
sure and sustain built-in product excellence. 

Table D-1 lists the microprocessor and microcontroller reliability tests, the 
duration of the test, and sample size. The following defines and describes 
those tests in the table. 

ACQ (Average Outgoing Quality) Amount of defective product in a pop- 
ulation, usually expressed in terms of 
parts per million (PPM). 



FIT (Failure In Time) 



Operating lifetest 



Estimated field failure rate in number 
of failures per billion power-on device 
hours; 1000 FITS equals 0.1 percent 
fail per 1000 device hours. 

Device dynamically exercised at a high 
ambient temperature (usually 125°C) 
to simulate field usuage that would 
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High-temperature storage 



Biased humidity 



Autoclave (pressure cooker) 



Temperature cycle 



Thermal shock 



FIND 



expose the device to a much lower 
ambient temperature (such as 55°C). 
Using a derived high temperature, a 
55°C ambient failure rate can be cal- 
culated. 

Device exposed to 150°C unbiased 
condition. Bond integrity is stressed in 
this environment. 

Moisture and bias used to accelerate 
corrosion-type failures in plastic 
packages. Conditions include 85°C 
ambient temperature with 85-percent 
relative humidity (RH). Typical bias 
voltage is +5 V and ground on alter- 
nating pins. 

Plastic-packaged devices exposed to 
moisture at 121°C using a pressure of 
one atmosphere above normal pres- 
sure. The pressure forces moisture 
permeation of the package and accel- 
erates corrosion mechanisms (if pres- 
ent) on the device. External package 
contaminates can also be activated 
and caused to generate inter-pin cur- 
rent leakage paths. 

Device exposed to severe temperature 
extremes in an alternating fashion 
(-65°C for 15 minutes and 150''C for 
1 5 minutes per cycle) for at least 1 000 
cycles. Package strength, bond qual- 
ity, and consistency of assembly pro- 
cess are stressed in this environment. 

Test similar to the temperature cycle 
test, but involving a liquid-to-liquid 
transfer, per MIL-STD-883C, Method 
1011. 

Particle Impact Noise Detection test. 
A non-destructive test to detect loose 
particles inside a device cavity. 



Mechanical Sequence: 

Fine and gross leak 
Mechanical shock 

PIND (optional) 

Vibration, variable frequency 

Constant acceleration 

Fine and gross leak 



Per MIL-STD-883C, Method 1014.5 

Per MIL-STD-883C, Method 2002.3, 

1500 g, 0.5 ms. Condition B 

Per MIL-STD-883C, Method 2020.4 

Per MIL-STD-883C, Method 2007.1, 

20 g. Condition A 

Per MIL-STD-883C, Method 2001.2, 

20 kg. Condition D, Y1 Plane min 

Per MIL-STD-883C, Method 1014.5 
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Electrical test 

Thermal Sequence: 

Fine and gross leak 
Solder heat (optional) 
Temperature cycle 

(10 cycles minimum) 
Thermal shock 

(10 cycles minimum) 
Moisture resistance 
Fine and gross leak 
Electrical test 

Thermal/Mechanical Sequence: 

Fine and gross leak 
Temperature cycle 
(10 cycles minimum) 
Constant acceleration 

Fine and gross leak 
Electrical test 

Electrostatic discharge 
Solderability 
Solder heat 

Salt atmosphere 

Lead pull 

Lead integrity 

Electromigration 

Resistance to solvents 



To data sheet limits 



Per MIL-STD-883C, Method 1014.5 

Per MIL-STD-750C, Method 1014.5 

Per MIL-STD-883C, Method 1010.5, 

-65 to +150°C, Condition C 

Per MIL-STD-883C, Method 1011.4, 

-55 to +125°C, Condition B 

Per MIL-STD-883C, Method 1004.4 

Per MIL-STD-883C, Method 1014.5 

To data sheet limits 



Per MIL-STD-883C, Method 1014.5 

Per MIL-STD-883C, Method 1010.5, 

-65 to +150°C, Condition C 

Per MIL-STD-883C, Method 2001.2, 

30 kg, Y1 Plane 

Per MIL-STD-883C, Method 1014.5 

To data sheet limits 

Per MIL-STD-883C, Method 3015 
Per MIL-STD-883C, Method 2003.3 
Per MIL-STD-750C, Method 2031, 
10 sec 

Per MIL-STD-883C, Method 1009.4, 
Condition A, 24 hrs min 
Per MIL-STD-883C, Method 2004.4, 
Condition A 

Per MIL-STD-883C, Method 2004.4, 
Condition B1 

Accelerated stress testing of con- 
ductor patterns to ensure acceptable 
lifetime of power-on operation 
Per MIL-STD-883C, Method 2015.4 
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Reliability Stress Tests 



Table D-1. Microprocessor and Microcontroller Tests 



TEST 


DURATION 


SAMPLE SIZE 1 






PLASTIC 


CERAMIC 


Operating life, 125°C, 5.0 V 


1000 hrs 


129 


129 


Operating life, 150°C, 5.0 V 


1000 hrs 


77* 


77 


Storage life, 150°C 


1000 hrs 


77 


77 


Biased 85°C/85 percent RH, 5.0 V 


1000 hrs 


129 


- 


Autoclave, 12rC, 1 ATM 


240 hrs 


77 


- 


Temperature cycle, -65 to 1 50°C 


1 000 cyct 


129 


129 


Temperature cycle, to 1 25°C 


3000 eye 


129 


129 


Thermal shock, -65 to 150°C 


200 eye 


129 


129 


Electrostatic discharge, ±2 kV 




12 


12 


Latch-up (CMOS devices only) 




5 


5 


Mechanical sequence 




- 


38 


Thermal sequence 




- 


38 


Thermal/mechanical sequence 




- 


38 


FIND 




- 


45 


Internal water vapor 




- 


3 


Solderability 




22 


22 


Solder heat 




22 


22 


Resistance to solvents 




15 


15 


Lead integrity 




15 


15 


Lead pull 




22 


- 


Lead finish adhesion 




15 


15 


Salt atmosphere 




15 


15 


Flammability (UL94-V0) 




3 


- 


Thermal impedance 




5 


5 



*lf junction temperature does not exceed plasticity of package, 
tpor severe environments; reduced cycles for office environments. 

Table D-2 lists the TMS320C30 device, the approximate number of transis- 
tors, and the equivalent gates. The numbers have been determined from design 
verification runs. 

Table D-2. TMS320C30 Transistors 



DEVICE 


# TRANSISTORS 


# GATES 


CMOS: TMS320C30 


600K-700K 


200K 



Tl Qualification test updates are available upon request at no charge. Tl will 
consider performing any additional reliability test(s), if requested. For more 
information on Tl quality and reliability programs, contact the nearest Tl field 
sales office. 
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Reliability Stress Tests 



Note: 

Texas Instruments reserves the right to make changes in MOS Semicon- 
ductor test limits, procedures, or processing without notice. Unless prior 
arrangements for notification have been made, Tl advises all customers to 
reverify current test and manufacturing conditions prior to relying on 
published data. 
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PENNSYLVANIA: Blu* Ball (215) 825-9500. 

PUERTO RICO: Halo Ray (809) 753-8700. 

TENNESSEE: Johnson CHy (615) 461-2192. 

TEXAS: Austin (512) 250^769: 

Houston (713) 778-6592: Richardson (214) 680-5082: 

San Antonio ^12) 496-1779. 

UTAH: Murray (801) 266-8972. 

VIRGINIA: Fairfax (703) 849-1400. 

WASHINGTON: Radmond (206) 881-3080. 

WISCONSIN: Brookflak) (414) 782-2899. 



TI Regional 
Technology Centers 

CALIFORNIA: Irvlna (714) 660-6140: 
Santa Clara (408) 748-2220: 
Torranca (213) 217-7019. 

COLORADO: Aurora (303) 368-6000. 

GEORGIA: Norcrosa (404) 662-7945. 

laiNOIS Arlington HalghU (313) 640-2909. 

MASSACHUSETTS: Waltham (617) 895-9196. 

TEXAS: Richardson (214) 680-5066. 

CANADA: Nepaan, Ontario (613) 726-1970. 



TI Distributors 



TI AUTHORIZED DISTRIBUTORS 
Arrow/Klerulff Electronics Group 
Arrow Canada (Canada) 
Future Electronics (Canada) 
GRS Electronics Co., Inc. 
Hall-Mark Electronics 
Marshall Industries 
Newark Electronics 
Schweber Electronics 
Time Electronics 
Wyle Laboratories 
Zeus Components 

-OBSOLETE PRODUCT ONLY- 
Rochester Electronics, Inc. 
Newburyport, Massachusetts 
(617) 462-9332 



ALABAMA: Arrow/Kieruitt (205) 837-6955: 
Hall-Mark (205) 837-8700: Marshall (205) 881-9235: 
Schweber (205) 895-0480. 

ARIZONA: Arrow/KierulH (602) 437-0750: 
Hall-Mark (602) 437-1200: Marshall (602) 496-0290: 
Schweber (602) 997-4874: V^s (602) 666-2888. 

CALIFORNIA: l-oa Angties/Oranga County: 

Arrow/KleruKt (816) 701-7500, (714) 838-5422: 

Hall-Mark (818) 716-7300, (714) 669-4100, 

(213) 217-8400: Marshall (818) 407-0101. (818) 459-5500, 

(714 458-5395: Schweber (818) 999-4702: 

mt] 863-0200. (213) 320-8090: Wyle (213) 322-9953. 

(818) 880-9000, (714) 863-9953: Zeus (714) 921-9000: 

Sacramento: Hall-Mark (916) 722-6600: 

Marshall (916) 635-9700: Schweber (916) 929-9732: 

Wyle (916) 638-5282: 

San Diego: Arrow/Kierulfl (619) 565-4800: 

Hall-Mark (619) 266-1201: Marshall (619) 578-9600: 

Schweber (61 9) 450-0454: Wyle (61 9) 565-91 71 : 

San Francisco Bay Area: Arrow/KierulH (408) 745-6600, 

Hall-Mark (408) 432-0900: Marshall (408) 942-4600; 

Schweber (408) 432-7171; Wyle (408) 727-2500; 

Zeus (408) 998-5121. 

COLORADO: Arrow/Kierulfl (303) 790-4444; 
Hall-Mark (303) 790-1662: Marshall (303) 451-6383: 
Schweber (303) 799-0258: Wyle (303) 457-9953. 

CONNETICUT: Arrow/Kierulfl (203) 265-7741; 
Hall-Mark (203) 269-0100: Marshall (203) 265-3822; 
Schweber (203) 748-7080. 

FLORIDA: Ft Uuderdala: 

Arrow/Kierulfl (305) 429-6200; Hall-Mark (305) 971-9280; 
Marshall (305) 977-4880: Schweber (305) 977-7511; 
Orlando: Arrow/KJerulfl (305) 725-1480, (305) 682-6923; 
Hall-Mark (305) 855^020: Marshall (305) 767-6585; 
Schweber (305) 331-7555: Zeus (305) 365-3000; 
Tampa: Hall-Mark (813) 530-4543; 
Marshall (813) 576-1399. 

GEORGIA: Arrow/KieruM (404) 449-8252; 
Hall-Mark (404) 447-6000: Marshall (404) 923-5750: 
Schweber (404) 449-9170. 

ILUNOIS: Arrow/Kierulfl (31 2) 250-0500: 
Hall-Mark (312) 860-3800: Marshall (312) 490-0155; 
Newark (312) 784-5100; Schweber (312) 364-3750. 



KANSAS: Kansas CHy: Arrow/Kierulfl (913) 541-9542; 
Hall-Mark (913) 886-4747: Marshall (913) 492-3121: 
Schweber (913) 492-2922. 

MARYLAND: Arrow/Kierulfl (301) 995-6002: 
Hall-Mark (301) 988-9800; Marshall (301) 840-9450: 
Schweber (301) 840-5900: Zeus (.T>ii 997-1118. 



Texas 
Instruments 



MASSACHUSETTS Anow/Kjerulff (617) 935-5134; 
Hall-Mark (617) 667-0902; Marshall (617) 658-0810; 
Schweber (617) 275-5100. (617) 657-0760: 
Time (617) 532-6200; Zeus (617) 863-8800. 

MICHIGAN: Detroit: Arrow/Kierulfl (313) 971-8220: 
Marshall (313) 525-5350; Newark (313) 967-0600; 
Schweber (313) 525-8100: 
Grand Rapids: Arrow/Kierulfl (616) 243-0912. 

MINNESOTA: Arrow/Kerutfl (612) 830-1600: 
Hall-Mark (612) 941-2600: Marshall (612) 559-221 1; 
Schweber (612) 9415230. 

MISSOURI: St. Louis: Arrow/Kierulfl (314) 567-6688; 
Hall-Mark (314) 291-S3J0: Marshall (314) 291-4650; 
Schweber (314) 739-0526. 



NEW JERSEY: Arrow/Kierulfl (201) 538-0900, 
(609) S96-800O; GRS Electronk:s (609) 964-8560; 
Hall-Mark (201) 575-4415, (609) 235-1900; 
Marshall (201) 882-0320, (609) 234-9100; 
Schweber (201) 227-7880. 

NEW MEXICO: Arrow/Kierulfl (505) 243-4566. 

NEW YORK: Una Island: 

Arrow/Kierulfl (516) 231-1000: Hall-Mark (516) 737-0600: 

Marshall (516) 273-i424; Schweber (516) 334-7555, 

Zeus (914) 937-7400; 

Rochester: Arrow/Kierulfl (716) 427-0300; 

Hall-Mark (716) 2449290: Marshall (716) 235-7620; 

Schweber (716) 424-2222; 

Syracuse: Marshall(6a7) 798-1611. 

NORTH CAROUNA: /krrow/Kierulfl (919) 876-3132, 
(919) 725-8711; Hall-Mark (919) 872-0712: 
Marshall (919) 878-9382; Schweber (919) 876-0000. 

OHIO: Cleveland: /Vrow/Kisrull((216) 248-3990; 

Hall-Mark (216) 3494632; Marshall (216) 246-1788: 

Schweber (216) 464-2970; 

Columbus: Arrow/Klerum (61 4) 436-0928: 

Hall-Mark (614) 888.3313; 

Dayton: Arrow/Kierulfl (51 3) 435-5563: 

Marshall (513) 698-4480; Schweber (513) 439-1800. 



PENNSYLVANIA: /krrow/Kierulfl (412) 656-7000, 
(215) 928-1800: GRS Electronics (215) 922-7037; 
Schweber (215) 441-0600, (412) 963-6804. 

TEXAS: Austin: Airow/Kieruin (512) 835-4180: 
Hall-Mark (512) 258-8648: Marshall (512) 837-1991; 
Schweber (512) 339-0088; Wyle (512) 834-9957; 
Dallas: Arrow/Kierulfl (214) 380-6464: 
Hall-Mark (214) 553-43O0: Marshall (214) 233-5200: 
Schweber (214) 661-6010: Wyle (214) 235-9953: 
Zeus (214) 783-7010: 
Houston: Arrow/Kltrulfl (713) 530-4700; 
Hall-Mark (713) 781-6100: Marshall (713) 895-9200: 
Schweber (713) 784-3600; Wyle (713) 879-9953. 

UTAH: Arrow/Kierulfl (801) 973.6913: 

Hall-Mark (801) 972-1008; Marshall (801) 485-1551; 

Wyle (801) 974-9953. 



WISCONSIN: Arrow/Kierulfl (414)7920150: 
Hall-Mark (414) 797-7844; Marshall (414) 797-8400; 
Schweber (414) 784-9020. 

CANADA: Calgsry: Future (403) 235-5325: 

Edmonton: Futura (403) 438-2858: 

Montreal: Arrow Canada (514) 735-5511; 

Future (514) 694-7710; 

Ottawa: Arrow Canada (613) 226-6903: 

Future (613) 620-8313: 

Ouebee City: Arrow Canada (41 8) 687-4231 : 

Toronto: Arrow Canada (416) 672-7769; 

Future (416) 638-4771: 

Vancouver: Futura (604) 294-1166: 

vnnnlpeg: Future (204) 339-0554. 

Customer 
Response Center 

TOLL FREE: (800) 232-3200 



Expires December 31 , 1 989 SPR39IPR800R 

TMS320 Literature Request Card 

Please send me literature regarding the following TI\/IS320 DSP areas: 

PR01 [] DSP Applications 

PR02 [] DSP Development Support 

PROS [] DSP University Program 

PR04 [] First-Generation TMS320 

PR05 [] Second-Generation TMS320 

PR30 [] Please add me to your mailing list. 

Name 

Company ^Title 

Address 



City/State/Zip Telephone 



Reader Response Card September 1988 

Third-Generation TMS320 User's Guide 

Please use this form to communicate your comments about this document, its organization and subject 
matter, for the purpose of Improving technical documentation. Thank you for your cooperation. 

1 ) What do you feel are the best features of this document? 



2) How does this document meet your digital signal processing needs? 



3) Do you find the organization of this document easy to follow? If not, why? 



4) What additions do you think would enhance the structure and subject matter? 



5) What deletions could be made without affecting overall usefulness? 



6) Is there any incorrect or misleading information? . 



7) How would you improve this document? 



Name 

Company ^Title 

Address 



City/State/Zip ^Telephone 

SPRU014 



PLACE 

STAMP 

HERE 



Literature Response Center 
Texas Instruments Incorporated 
P.O. Box 809066 
Dallas, TX 75380-9990 



PLACE 

STAMP 

HERE 



Technical Publications Manager 
Texas Instruments Incorporated 
P.O. Box 1443, MS 640 
Houston, TX 77001 



im i ffirwaTw^vjiia.^ MJBt. jj>j^jwu,jB»„i i t I. 







August 1988 
Printed in U.S.A. 



^ 
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